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r .Mn MtlCLEOTIDE FRAGMENTS ASSOCIATED WITH 

THERAPEUTIC PURPOSES 

ffA<i\ is a demyelinating 
Mnitinle sclerosis (MS) \s <x j 

"Multiple St. prevalence m 

T /«icoa«se of young adults wit" « f 
neurological disease of y y 20 and 200 per 

- --/"%r^ nractlirer cUnlca^^^ - ^ 
100,000. It ^^^^^.^ progressive course, 

relapsing/remitting o ...^bility. current knowledge 

.uggeats that important Influence and that 

15 ,eneti= bacK,rouna has an ^ ^^^^^^ 

..infectious" agent(.) »ay candidates but ae 

viruses have been proposed as possi .etiological 
yet, none of the- has been shown to play 

Many studies have supported the hypothesis of a 
" 7 .K. Htsease but none of the taiovnr. 

Viral aetiology J;^/;^ ^;,. <^„„, egent sought: a 
viruses tested has proved to 

„vie„ .tdK-X. .ohnson • 

-„ co„p.^ by y -"-^-r-n'th::: 

<„..vs and HXVS, was followed by great ^^^^^^^ 
ebillty to l^air the i:«une ^^st- "n^ " P 
" nr^vfTtr'^sririon With a Chronic 

30 rnriaJLy%e:yelinating diseae^ .n^h J">J-^^ 
extensive ---Jj-^J;, river. despite initial 
ri™ e ;es::oe%f H..v.i or HT.V.li.e retroviruses 
was not confirmed. 
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Recently, a retrovirus different from the known 
human retroviruses has been isolated in patients suffering 

from MS (3, 4, and 5). 

in 1989, the authors described the production of 
5 extracellular virions, associated with reverse 
transcriptase (RT) activity, by a culture of 
leptomeningeal cells (LM7) obtained from the cerebrospinal 
fluid of a patient with MS (3). This was followed by 
similar findings in monocyte cultures from a series of MS 
10 patients (5). Neither viral particles nor viral RT- 
activity were found in control individuals. Furthermore, 
the authors were able to transfer the LM7 virus to non- 
infected leptomeningeal cells in vitro (26). The molecular 
Characterization of the "LM7" retrovirus was a 
15 prerequisite for further evaluation of its possible role 
in MS. considerable difficulties arose from the absence of 
continuously productive retroviral cultures and from the 
low levels of expression in the few transient cultures. 
The strategy described here focused on RNA from 
20 extracellular virions, in order to avoid non-specific 
detection of cellular RNA and of endogenous elements from 
contaminating human DNA. A specific retroviral sequence 
associated with virions produced by cell cultures from 
several MS patients has been identified. The entire 
25 sequence of this novel retroviral genome is currently 
being obtained using RT-PCR on RNA from extracellular 
virions. The retrovirus previously called "LM7 virus- 
corresponds to an oncovirus and is now designated MSRV 
(Multiple sclerosis-associated Retrovirus) . 
30 The authors were also able to show that this 

retrovirus could be transmitted in vitro, that patients 
suffering from MS produced antibodies capable of 
recognizing proteins associated with the infection of 
leptomeningeal cells by this retrovirus, and that the 
35 expression of the latter could be strongly stimulated by 
the immediate-early genes of some herpesviruses (6). 
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All these results point to the role in MS of at 
least one unknown retrovirus or of a virus having reverse 
transcriptase activity which is detectable according to 
the method published by H. Perron (3) and qualified as 
..LM7-like RT" activity. The content of the publication 
identified by (3) is incorporated in the present 
description by reference. 

Recently, the Applicant's studies have enabled 
two continuous cell lines infected with natural isolates 
originating from two different patients suffering from MS 
to be obtained by a culture method as described in the 
document WO-A-93 / 20188 , the content of which is incorpor- 
ated in the present description by reference. These two 
lines, derived from human choroid plexus cells, designated 
15 LM7PC and PLI-2, were deposited with the ECACC on 
22nd July 1992 and 8th January 1993, respectively, under 
numbers 92072201 and 93010817, in accordance with the 
provisions of the Budapest Treaty. Moreover, the viral 
isolates possessing LM7-like RT activity were also 
20 deposited with the ECACC under the overall designation of 
"strains". The "strain" or isolate harboured by the PLI-2 
line, designated POL-2, was deposited with the ECACC on 
22nd July 1992 under No. V92072202. The "strain" or 
isolate harboured by the LM7PC line, designated MS7PG, was 
25 deposited with the ECACC on 8th January 1993 under 

No. V93010816. 

Starting from the cultures and isolates 
mentioned above, characterized by biological and 
morphological criteria, the next step was to endeavour to 
30 characterize the nucleic acid material associated with the 
viral particles produced in these cultures. 

The portions of the genome which have already 
been characterized have been used to develop tests for 
molecular detection of the viral genome and 
immunoserological tests, using the amino acid sequences 
encoded by the nucleotide sequences of the viral genome, 
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. t= detect the im^ne respone. dlr.oted .g.ln.t 
m order to detect tn infection and/or viral 

epitopes associated with the infect! 

e.presslon^ ^^^^ ^^^^^^^ .sso^^^i^ 

, ,o he confine .«-en MS -—K^evS 
sequences identified in ' Is related to 

the viral system discovered by the Appl ^„ 

3 c»ple. r ^e el: u^ar viral'partlcles 

^ ,ound "■'7;/,;;„rUtres of cells of patients 
" rrr:r%r - de^ly th. there - 

coencapsidation Of ^^^^ r^:::^ .enome .hlch 
hut. differ«.t from the w yp^^^^^^^^ This phenomenon 

r.r::» --- het-een repucatlve — in.s.s an. 
" rn^cenous retroviruses helon,in, to «>e same f»Uy or 
even heterologous retroviruses The """""^^ 
retroviruses is very '»^-""\ \^^..°°T has heen 
discovery since, in the "/^^ .,„p,iein, 

.0 Observed that "™',enol exist in normal 

ences ^ZIL retroviral elements 

human D»A. ffl.e „^ their genome 

„«v, related to ^ the MSRV-1 

explains the fact th i„teraot with closely 

25 retrovirus in human cells is able t ^ 

PaZ^nic" a„d/or infectious 
found in the case o p ^cotropic 
endogenous retroviruses (for exaBpl 
» • . „f the murine leukaemia virus) , ana in me 
strains of ^^^^^ nucleotide sequence may be 

30 exogenous retroviruses »h 

found partially or wholly, ^ tu^r 
host animars genome (e.g. ^""^'^^'^ interactions 
Virus transmitted via - ^^^^Ir coactivation 

consist mainly of (1) a «. .^^ ,U) and 

,5 cf ERVs by the "Pl^";^^^^ „,,ted to ERVS, or 
"illegitimate- encapsidation of FMAs reia 
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n of cellular RNAs - simply possessing 

of ERVs - or even of .„ ^he retroviral 

co.pa.i..e ^^^^^^X^:,/^^^^^ Of tne replicative 

particles produced by the ^ .^^^ ,,„eti«es 

strain, which are ^^^^ ^^^^ 

.i.h a P^-»^°^-^":^;, ,:::;en the coencapsidatea 

substantial recombxnat.ons ^^^^^^ ^^^^^^^ 

genomes, in P^'^^'^^'fJ ^he formation of hybrid 

^ription sometimes transmissible and sometimes 

genomes, wnicn ai.^ 

with a P-'>°^-^=;7;'„'rrt°:^en=es rel.tea to MSBV-. 

Thus, (X O'"''^' J ,,,,1 particles; (ii) 
,ave been """^ ^e au/erent regions of the MS«V-1 

„„Xecul« ^ , by systematically 

.etrcviral ,e„o»e mte.ferin, and/or 

analy^in? the =oen P ^ Infection 

.eccBbinea sequences "^-^ ^^J^^^,,,. clones may 

ana/or expression "^^J' „^ hy the 

have detective sequence P „ 

-"°-"\-^^r:rrlvre -nrcrlptase, ,111, the 
, „£ transcription of ^e r „trovlral 
families ot sequences related aiagnostlo 

genomic --J-^r^V: ^e Identification of 
detection which may " expressed, and by the 

invariable regions among the clo^^^ ^^^^^^^^^ 

, identification of read g ^^^^^^^^ polypeptides 

production of antigen „«rtion or even by just 

which may be produced only oy v 

these conditions/ 

one, of the clones e-^^'-^-^f ;^^„,3 .^..esed In the 
the systematic analysis „equency of variation 

,0 region of a ,iven gene J^^^,,^!, ,,„ome In this 

and,or of "-"f "f^-J^roptimal sequences for the 
region to be -luated -d the p^^^ .^..^oatlons, to be 

applications ^" J"'^^^ ^,„,^ y., a retrovirus such as 

defined, (iv) '^"^.""^ feet of its expression and of the 
35 HSRV-1 may be a direct effect 

proteins or peptides produced as 
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thus, th«. genome, associated wi 
, and/or infection by HSRV-l are an .nte,ra P^^ ^^^^ 

potential "'"""^"'f^J^^^l.ir detection and .pedal 
constitute means associated with 

therapeutic "^^^ J^^^^^^^..^^ Lponslhle for the 

or cofactor of these ini. gliotoxic 

" -----^^^r^eri^d^l tr patent application 
factor Which are a.s^.r> p,,,ielpate In the 

published -"-■^ .ery effective strategy for 

development of an overa monitoring and/or 

the diagnosis, prognosis, '"^"PJ^ .„y 
integrated therapy =f J - ^.^f ^r agent.. 

"T%rr::r a 'trallel discovery has heen 

.ade m aether autoimmune /r^teUTa:::: 
,^>, Which has been ^-^^'^^^'-..t ,,„overy show. 
,0 application filed ^ "^J^^.^..^ similar to the 

that, by applying ^^^l^'^'^^^.Z^., on MS. it wa. 

ones Which were used in '"JJ ^ «^ich 

possible to Identify . retr ^^^^ 
,,,«s the seguences ^^^^^ ,^,„.„„, 

" in HS r. regards M»V-1. the seguences detected 

aescribed n HS As -J genes. In 

In common in MS and R& reia possible to 

•.4- efate of knowledge, it is 
the current state oi a.sorlbed with the 

associate the gag -reTwr.iseases. 
3„ «3.v-l strains expressed n^thes ^^^^^^^^^^ 

u,..h are additional to those already 
,,,,ous results J"^ ,,„„t Applications, 

"""°':l«ar Of 03.0..1»«. published under 

- No. 92/04322 
35 NO. 2,689,519; 
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- NO. 92/13447 
NO- 2,689,521; 

- NO- 92/13443 
NO- 2,689,520; 

- NO. 94/01529 
NO. 2,715,936; 

- NO. 94/01531 
NO. 2,715,939; 

- NO. 94/01530 
NO. 2,715,936; 

- No. 94/01532 
NO. 2,715,937; 

- No. 94/14322 



of 03.11.1992, 

of 03.11.1992, 

of 04.02.1994, 

of 04.02.1994, 

of 04.02.1994, 

of 04.02.1994, 

of 24.11-1994, 



15 



of 23.12.1994 



published 
published 
published 
published 
published 
published 
published 
; published 



under 



under 



under 



under 



under 



under 



under 



under 



20 



25 



30 



35 



No. 2,727,428; 
- and No. 94/15810 

'■'"'■Z present invention relates, in the first 
„la=e to . viral ^teri.l. in the isolated or purxfxed 
state', ""i* -V "-'"^"^ =har.cter»ea xn 

dif,«rent -V^" „„cleotide sequence chosen fro. 

- its genome coraprises a uu^x 

"o". ; ^ir co»ple.entary sequences ana their equivalent 
N0.89, their J nucleotide sequences displaying, 

sequences, m particular monomers, at least 

any -^"^ tZr^^o..lo,y with the said 

Holao SEQ ID »o:el, SEO 10 »0=8,, respectively, and their 

"T^ZZ TZZ^ co.prisin, the env and pel 
-the «,e ,a, qene, excluding the 

on havinq a sequence identical or equivalent to 
r:r::=l <::^es for ly polypeptide dlsplaymq, for any 
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at least 30 amino acids, at least 
.onUguous ^„„olo,v with . peptide 

,„d ''"'-•">^/*,„JT^,,eotid. .e^e„=e chosen fro. 
sequence encoded by " „ 

:::::: To ro;:o's«> xo »o=ex sko xo .0., ^d ^..r 

complementary „„cleotlde sequence partially 

^TotHlV" or equivalent to XO »o=S, or 

^\:r;lq:rarser:°::;ieotlde sequence partially 

- the gag « equivalent to SEQ ID »0:88. 

or totally identical or eM present 
AS indicated above, according to the pr 

. ^.^r viral material as defined above is 

invention the viral ^^^^^^ ^ 

- nr:eror-»s. . and m^e esp«..lly to the 

i— also relates to a nucleic 
" material, t L isolated or purified state, having at 
least one of the following definitions • 

-i^ir. material comprising a nucieotioe ^ 

" ' n from the group including sequences SEQ ID KO:93, 
selected from the grouP ^^^^^^ ^^.^^^^^ 

25 SEQ ID NO:94, SEQ ID N0.169, SEQ ^^^^^^^ 

SEQ ID NO: 176, 

SEQ ID NO; 172, « complementary 

SEQ ID no: 178, SEQ ID NO: 179, their c P 

o «nd their equivalent sequences, in particuia 
sequences and tneir «„rcession of 100 
nucleotide sequences least 
30 contiguous monomers at least « ^ 

rr'sEO r»0.iet XO HO=170. XO »0=X71. 

SEaXDH0:172, ^ ^ j^elr complementary 

SEO XD »o:i7S. ^^/"/^^t, '.or ERV-„ ; advantageously, 
35 sequences, excluding HSERV 9 ^ 
the nucleotide sequence of said 
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SEQ ID NO:94, SEQ ID N0.169 SEQ 

<5EO ID NO: 17 6, 

SEQ ID no: 172, J''^'" complementary 

3E.ID.0:l.S ..,:ies, in particular 

5 sequences and thexr ^^"^ ^ succession of 100 

nucleotide sec^ences ^^^^^JJ^^^^^^ .^.efera.ly at least 
contiguous monomers at 1 ^^^^^^^es SEQ ID NO:93, 

80% homology "^^^^ j^o.^^O, SEQ ID N0:171, 

SEQ ID NO:94, SEQ ID NO. 169, SEQ ^^^^^^^ 

SEQ ID NO: 176, 

10 SEQ ID no: 172, complementary 
SEQIDN0:178, SEQlDNO:l79, and 

sequences ; ^ . ^ . isolated or purified state, 

_ a nucleic material, j.« . , •„„ for any contiguous 
coalng tor any polypeptide "-P^^^-^J J/"^ 
^4- ipast 30 amino acias, 
15 .ucoession of at lea=t f^„bly at Last 70* 

preferably at J^^ .y any nucleotiae 

homology "1th a peptide i.„eludin, SEQ ID »0=93, 

«,uen=e selected from the ,rc^P 

" :SlDr=i«: SEQID»0=1V. .nd their oo.ple.ent.ry 

eequencesj isolated or purified state, 

. a nucleic material, ^ nucleotide sequence 

Of retroviral type, -JJ'^;^^ poi qene of 

25 identical or similar " multiple sclerosis 

„ isolated ."";;^;;\;;\rnta,eously, said nucleotide 
re,::rr"o niuar'to said at least part of the ,ene 
poi? •^ir.r, a nucleotide sequence 

" - ' sTrartorTeastU Of the pol ,en of an 

identical or ""^l" ^ transcriptase having a 

isolated Virus .mine acid domains 

•rt.crrr of a refere.. 
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sequence according to UPGM «ee option of the oeneworks™ 

.U. t.e .ino .i. .o.ein.. 

nucleotide fragments each comprising a nucieotlae seguence 
chosen from the group Incluain,! 

,e, all the genomic seguencee, partial and total, of the 
Icl gine of the MSRV-1 virus, except for the total 
.0 TegueH of the nucleotide frag^ht defined hy 

genomic s«,uenc.s, partial and total, of the 

env gene of MSRV-l; 

Z !ll the partial genomic sequences of the gag g.n. of 

" Tlu the genomic sequences overlapping the pol gene and 
Z env gene of the MSRV-1 virus, and overlapping the pol 
gene and the gag gene, 

(.1 all the sequences, partial and total, oi 

f.om the group including the clones FBd3 
20 chosen from tne gxu"^ TTRni 

t pel (SEQ ID NO:51), JLBd 

(SEQ ID NO:46), ^ pox v 

« =ox TTBC2 (SEQ ID NO: 53) and GM3 

<s----;^ 'rcirde""seq:re 

" rntrcar=::'or rr^tne sequence defined . 

SEQ ID NO:l; ^^^.^^v to the said genomic sequences; 

(f) sequences complementary to cne s y „ to /-^ 

^ • the said sequences (a) to (e) , 

(q) sequences equivalent to the saia ^ 
,0 Tn particular nucleotide — s -splav^ng for an. 
^-F inn contiquous monomers, j.ccii=. 
:::::::::: le"o.\omoloqv with the said sequences 

^'^iLfthat this nucleotide fragment does not comprise 
provided that this described in LA MANTIA 

35 or consist of the sequence ERV 9 as ae 

et al. (18)- 
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«e «r» genomic sequences, partial « 

..,„-l.tea by ooenoapsidation or By 
includes all sequences associatea oy 

coexpression, or reco»bined '^^"'^l'.^^^, 

preferably, such a fragment =°»>P"=-^- ^ 

. either a nucleotide of C«S«V-1 

total ,eno„lc .e^ence °^ the „„,,,„,,ae 
virus, except for the tot , identical to any 

..e^ent -^^"\^J\:^J"3a a partial or total ,eno»ic 
IZ^ZrTXJLr^''' is ho«lo,ous to the 

nucleotide sequence -"cal to partial «t«^^^ 

,eno»i= sequence of the env qene " "^^^ ^. „^ 

laentical to any any sequence 

„„cleotiae serenes or ^^^^^^^^ 

equivalent to the saia 

on. Which is ho.oloqous to latter^ ^^^^^^^ ^ 

in particular, the in ^ucleotiae 

„„cleotiae totally ialtical to a 

„ sequence which is P>-^""^ including: 

nucleotide sequence ""^.eTb^^^ „ „o=40, SEQ ID 

- the nucleotide sequence detined by 

»0:62 or SEQ ID •;° = "; „ SEO ID »0:62 or 

- sequences complementary to s.™ 

,5 SEQ ID H0:89; particular homologous to 

. sequences equivalent, and in P 

SEO ID H0=«, SEO ID »0:« "/^^^ -j;',,,,^,, sequence 

defined by SEO ID J*^^ ^ of a peptide sequence 

30 - sequences -^-J ,„„,,„,ous to SEQ ID »0 = 3,, SEQ 

equivalent, in part.c ^^^^^ ^^^^^^ ^^„, 

""'"/L s«a of patients infected with the MSHV-1 
Turgor in wh:: - HSKVI virus has been reactivated. 
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The invention also relates to a nucleotide 
fragment (called fragment I) having at least one of the 
following definitions : 

- a nucltiae .ra,„ent comprising a nucleoUae sequence 
5 selected fro. the grcup Including SEQ ID H0.», 
SEO ID »0 = 94, SEQ ID »0:169, SEO ID »O:170, SEQ ID H0 = 171, 
SEQ ID»0=I7 , SEQIDHO:m, SEQ ID HO = 177. 

SES lu > 4.helr complementary 

SEQ ID N0I17B, SEQ ID N0:17<>, their comp 

sequences and their e<r.ivalent sequences, in P-^^^^l^^ 
10 nucleotide sequences displaying, for any succession of 100 
contiguous monomers, at least 50% and preferably at least 
homology with said sequences and their complementary 
sequences, said group excluding SEQ ID N0:1, 
said nucleotide fragment not comprising nor consisting of 
15 the sequence HSERV-9 (or ERV-9,, preferably the nucleotide 
!^que„ce of said fragment is selected from the group 
i Zing SEQIO»0..«. SKQIDHO:,.. - ' 

SEQIDHO=170, SEQ ID »0:171, 

SEQ ID »0,176. =KQ ID »0:177, SEQ " 

20 SEQ ID no: 179, their complementary sequences and their 
Equivalent sequences, in particular nucleotide sequences 
displaying, for any succession of 100 contiguous monomers, 
at least 70% end preferably at least »0% ho.olo«- with 
said sequences and their complementary sequences; 
„ - a nucleotide fragment comprising a coding nucleot.ae 
sequence which Is partially or totally identical to a 
nucleotide sequence selected from the group -^^"^^"J 

cT?n TH NO •94. SEQ ID NO;lo9, 

SEQ ID NO: 93, SEQ ID NU.y^, 

xn Mn.1-71 SEQ ID NO; 172, 

SEQ ID N0:170, SEQ ID N0.171, 

30 SEQ ID N0:17e, SEQ ID HO:177, SEQ 

SEQ ID no: 179 ; their complementary sequences , their 

^ 4« narticular homologous to 
equivalent sequences, in particuxa ^^^^^ 
SEQ ID N0:93, SEQ ID NO:94, SEQ ID NO:169, SEQ ID NO. 170, 
SEQrDN0:17;, SEQIDN0:17., SEQIDN0:17S. 

35 SEQ ID NO:177, SEQ ID NO:178, SEQ ID N0:179; 
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seciuences encodin, .11 or p.rts of the peptide 

defined by SEfi ID SEQ ID SO:173, 

sequence defined „„.,„ sEQ ID NO 1 180, 

SE0IDHO = 174, SEQID»0:175, 

SEQ ID »0=181, SEQ ID HO:182,- ^ 

sequences enccdi- al o^^P^ ^^^^^^^^ ^ 

^""of tinrreco.ni.ed sera of patients infected 
. :r t J vi.us. o. m who. tne MSHV-i .as 

been reactivated. ,«T=tes to any nucleic acid 

invention „s and/or 

probe for tne ^etect.on^of v - sso 

rheu-atold arthritis (RA) , / a^,i„^ above, 

" specifically »ith a„V --^^^ id 

belonging or lying ,aeitlon, to any nucleic 

pathogenic "/^^ \ pathogenic and/or infective 

acid probe for ^-""^^^ J , ^ ..^able of hybridizing 
agent associated with RA, whi ^^^^^ ^ 

20 specifically with any ^"^^"^ especially with 

reference to the pol and gag genes 
respect to the sequences SEQ 'D » 

8, ana SEQ ID «0S 38 83 an 80. 

The invention also relates to P ^ 
„ amplification by oi^e.^^^^^^ . 

viral material, „^ equivalent to at least 

„.oleotide sconce '^J'lJ^^ any fragment 
one portion " particular a nucleotide 

such as is defined '^-O; -^J^^^^^„„ at least 10 
30 sequence displaying, contiguous Bonomera, 

contiguous monomers, preferably ^^^^ 

„ore preferably 18 -"''^""""^^Taast 70* homology 
preferably^ contiguous m.»^s, ^a ^^^^ ^^^^^^^ 

with at least tne a f primer is 

3. preferably, the "--'-J-^rnces selected from the 
identical to any one of the q 
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^7 to SEQ ID no: 50, 

group including f ^/''g^Q ^0:86, SEQ ID NO: 99 to 

SEQ ID NO:55, SEQ ID NO:64. SEQ ID N ^^^^^^^^ 
SEQlDNO:m, SEQIDK0:183. 

SEQ ID NO:185. SEQ ID ^0:186. i„,,„tion also 

'^""'''"l o/C ana in particular replication 
encoxapasses any HNA or DNA 

vector, co.pris ng a .eno^c ^^^^^ ^ 
material such as is 

fragment such as is ^-^^^^^ relates to the different 
r:;r o: rrerdC .-e .elonging to a 
peptides encoded ^"^ ^.f^^ed above, m 
nucleotide ^"^^^ ,,a„pie any oligopeptide 

particular any aeter^inant recognized 

forcing or --pr--^ - ^^^^ ,i^s and/or 

,5 by sera of patxents xnfected w ^^.^^^^^^ preferably, 

in Whom the MSRV-l vxrus has been ^^^^ 
this polypeptide is antigenxc, ..^^..^on, at 

reading frame ^^^i""^'^^' .^^ \^ nucleotide 330 of 
nucleotide 181 and endxng at 

,0 SEQ ID NO:l.^ ^^^^^^^^^ ^^^^ ^^^^^^^^3,3 following 
polypeptides : 

bv any open reading frame 

25 belonging to a nucxe 

above ; ^ in that the open reading 

. , polypeptide, ■*''"'=""^."''/" „ the 5-3' direction, 
ireme enooding it, L ""P^^'^;,,, „o4 of SEQ ID HO-.93; 
^tween nucleotide 18 and n-l-'^^; = coMprisln, e 

sequence partxaxxy 

b) v.4„«nt or synthetic, having a 

. a polypeptide, -^^^^^^rises a siguence identical or 

::rj:r:;r;rr..tic.. s.d poi.eptide 
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e=*lM« ,n en.v.«ic aCivi.v =onsiatln, o. profoXytlc 

in that t.e open r..ain, fr^ -/^renls a! nucleotide 
5.-3. direction, at nucleotide 18 and e 

3«0 of SEQ ID N0!93i .^.i^itory activity on the 

-./otr.rri.rtT.Voir:^ i acco^m. 



to b) ; 
10 c) 



polypeptide, — :r::„r;de::::! 

rrienrr:^ -o=::7r Particul. said polypeptide 
exhibits a reverse "-=-^^"-;r;u:r. „.icn co^^rises 

a sequence identical or eq ribonuclease 
particular said polypeptide exhibits 

activity; ^^^wtnant or synthetic, characterized 

. a polypeptide, ^^^^l^^l'^J^J,,,,, .eqins, in the 
20 in that the open reading fra»e e nucleotide 
5.-3' direction, at nucleotide 341 and en 

2304 of SEQ ID N^*'"' synthetic, characterized 

- ^ POIVPeP-^-- JeX; Lie encoding it begins, in the 
in that the open -^^^^ J^^^^ ^3,, ends at nucleotide 

25 5 '-3' direction, at nuoieo<.i 

2304 of SEQ ID HOI93. ,„hibitory activity on the 

- a polypeptide "-^"V ;'^;-;"7ypeptide as defined 

X'T«ron" "nu'cir: activity of a 
according to c) or ^. 
30 polypeptide as defined according to o,. _^ ^^^^^^^ 

in particular, the in patients 
anti,enic polypeptide -"^"^^^^ ^tVlhc «.! HSBV-1 
infected «ith the «SPV-1^-^ ,3 

Virus has been reactiva ' equivalent to the 

partially or totally iaehtica or is^ , 
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SEQIDHO=180, «>«""°-VJ^ ,,„p.e, « any sequence 
sequence 1. including t^he eequences 

SEQ ID no: 41 to 

SEQ ID no: 87. invention also proposes mono- or 

The present mventx 

above. relates to: 

The invent.cn ^3^,. cr cf an 

15 - reagents £cr detectxcn cf th ^^^^^ 

exposure tc the lat.er, -f ^^^^^^^isting c£ a prche ct 
.ubstance selected £rc. the grc P ^^^^ particular an 

the present invention ^ J'°^''^J,„,, ,,„ve, cr an ahti- 
antigenic peptide, such as s ^^^^ pclypeptide, 

„ llgand, in particular - ,,,,.peutic ccpositicns 

" . .11 diagnostic, P^P"""""" particular antigenic 

^p.i.ing one or -"J^^n^,,, one cr .ore anti- 
peptldes, such as are ^^'^"^ tc the peptides, 

" trorei:.^- a -.-%tr:et::s to a„v diagnostic 
The invention also particular for 

■ prophylactic or therapeutic c»pc-t-^^^ ^^^^ „,^,.t^ 
Uhiting the expres"on o^ at 1 ^^.^ the 
,„ vith MS or B.. and/cr ^ nucleotide fragment 

proteins of said virus, J polynucleotide, 
'such as is aef med J«« J ^ ,,^s„ce is partially 
particular fragment, except for that of 

. -I ^-hat of the sai« i.^- -tin nO:1- 

identical to tnaT^ " ,«„4.ide sequence SEQ 

3, .»g.ent -^,r:ort-^^^^^ - - 

Likewise, xt, re 



'^K' PCT/1^7/01482 

WO 98/23755 



17 



10 



therapeutic composition, in particular for inhibiting the 
expression of at least one pathogenic and/or infective 
agent associated with RA, comprising a nucleotide fragment 
such as is defined above by reference to the pol and gag 
genes, and especially with respect to the sequences 
SEQ ID NOS 40, 51, 56, 59, 60, 61, 62 and 89. 

According to the invention, these same fragments 
or polynucleotides, in particular oligonucleotides, may 
participate in all suitable compositions for detecting, 
according to any suitable process or method, a patho- 
logical and/or infective agent associated with MS and with 
RA, respectively, in a biological sample. In such a 
process, an RNA and/or a DNA presumed to belong or 
originating from the said pathological and/or infective 
15 agent, and/or their complementary RNA and/or DNA, is/are 
brought into contact with such a composition. 

The present invention also relates to any 
process for detecting the presence or exposure to such a 
pathological and/or infective agent, in a biological 
sample, by bringing this sample into contact wxth a 
peptide, in particular an antigenic peptide such as is 
defined above, or an anti-ligand, in particular an anti- 
body to this peptide, such as is defined above. 

in practice, and for example, a device for 
detection of the MSRV-1 virus comprises a reagent such as 
is defined above, supported by a solid support which is 
immunologically compatible with the reagent, and a means 
for bringing the biological sample, for example a sample 
of blood or of cerebrospinal fluid, likely to contain 
anti-MSRV-1 antibodies, into contact with this reagent 
under conditions permitting a possible immunological 
reaction, the foregoing items being accompanied by means 
for detecting the immune complex formed with this reagent. 

Lastly, the invention also relates to the detec- 
tion of anti-MSRV-1 antibodies in a biological sample, for 
example a sample of blood or of cerebrospinal fluid, 
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,,,, , .e„en. " ^^^r'^li^in. their possible 

i,«>noXoc,ic.l reaction, and t p ^^^^ ^.^^^i. 

5 complex thereby forsed v.th tnej-9 ^^^ 

Before ^^="^'7 Jl^ion end the =lai« are 
different terms used in the desorip 

def ined..^^^^^^ ^^^^^^ ^rs^^ - 

.0 infe.tive -^re^ aX^a^eria and.or para- 

i„,, example, ,„tigenlc power, 

sites, generating as an ex«.ple, a 

harboured by a culture or ^,ji„ition can contain 

viral strain according to the ti^t, 
. colnfectlve agent, .or ex^P e a P g^^ 

. the term "MSRV infective agent 

---- rTs Tn ;::r.arr^^^^^ — • re 

associated w.th MS, m P ^.^^^ ^^^^.^^ ^^e 

attenuated strains of the ^^^^ieles containing 
20 defective- 

interfering pa^- ^.^ernatively genomes 

coencapsidated g^""'";^' „sRV-l genoiae, derived 
recoBbined with a P--;-j;^;\„, especially viruses 

from this species. Vir ^^^^ ^ ^^^iability resulting, 

containing RNA. are ^^^es of spontaneous 

» - ^h^iU " in »ind below for 

mutation (7), ^"-^ ^ n^„i-*> 

defining the notion of egu-lence, ^^^^ ^ 

- or of being harboured by hu,»n 

capable of infecting, 

30 being., ^ natural or induced vari- 

. in view of encountered when 

ations and/or "-"""t"';: :irtl subjects of the 
implementing the P"""' """^" ^he claims, have been 
latter, defined above " aerivatives of the 

35 expressed including the eguivalen ^^^^^^ 
different biological materia 
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particular o£ the ho.o.o,ous nucleott.. or p.ptiae 
.«^.„c... ^ „^ . pathc,e.ic 

to the invention 

5 comprises at least on corresponding 
a„.,.oay : , .aia patncenic a„a,or 

antigen ot part of which Is 

infective .,e„t. ana/or a ^^^^^^^^^^^^ at 
detected by at leasi: on j onecific for the 

least one nucleotiae -^^".^-"""J' : ,tte U. such 
.aia Virus ana^or ^^-^ -1^ the p^i^ers ana 

°" '"'^ ^aCIn^' a nucleotlae se,^ence chosen fro. 
probes having a » „ no=16 

SEQ ID KO=20 to SEQ ID "O-"' = « g^Q ID NO=33, 

— r,"- -tirr h,:^:il.ation conaltlon. 

— to a P"--""tl '"th? tnltlon, a nucleotlae 
„ - acccramg « polynucleotiae is an 

fragment or an "^^--^^VtiopIlyL' characterizea by 
arrangement ot ,«.no»ers or a bxop y ^^^^^ 

the inforBatlonal sequence ot the nucleotide 
Which is capable ot ''VbrlaUin, With any o^^^^^^^^^ ^^^^^^ 

„ fragment under ^^^'1'^^^^^:^^^:^.^ of altterent 
£„r the " °n,talnea tro. a molecule o, 

chemical structures and to ^ ..combination 

natural nucleic '"^ ^.Jc.i^^ 

ana/or by chemical synthesis, nu ^^^^^ 

" — - mZrh/L particular a gene 

:r:rs viius :o: "Lple poI or env in the case ot the 

said virus, „ h« « natural nucleotide of 

- ' "irtutt 1 . nts are a sugar, a 

3S nucleic acid whose oor.t^^«^^^ the sugar 

phosphate group and a niT:r g 
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is ribose, in DNA the sugar is 2-deoxyribose; depending on 
whether the nucleic acid is DNA or RNA, the nitrogenous 
base is chosen from adenine, guanine, uracil, cytosine and 
thymine; or the nucleotide can be modified in at least one 

5 of the three constituent elements; as an example, the 
modification can occur in the bases, generating modified 
bases such as inosine, 5-methyldeoxycytidine, 
deoxyur idine , 5- (dimethylamino) deoxyuridine, 2 , 6- 

diaminopurine, 5-bromodeoxyuridine and any other modified 

10 base promoting hybridization; in the sugar, the 
modification can consist of the replacement of at least 
one deoxyribose by a polyamide (8), and in the phosphate 
group, the modification can consist of its replacement by 
esters chosen, in particular, from diphosphate, alkyl- and 

15 arylphosphonate and phosphorothioate esters, 

- "informational sequence" is understood to mean 
any ordered succession of monomers whose chemical nature 
and order in a reference direction constitute or otherwise 
an item of functional information of the same quality as 

20 that of the natural nucleic acids, 

- hybridization is understood to mean the 
process during which, under suitable working conditions, 
two nucleotide fragments having sufficiently complementary 
sequences pair to form a complex structure, in particular 

25 double or triple, preferably in the form of a helix, 

- a probe comprises a nucleotide fragment syn- 
thesized chemically or obtained by digestion or enzymatic 
cleavage of- a longer nucleotide fragment, comprising at 
least six monomers, advantageously from 10 to 1000 mono- 

30 mers, preferably 10 to 30 monomers and more preferably 18 
to 30, and possessing a specificity of hybridization under 
particular conditions; preferably, a probe possessing 
fewer than 10 monomers, but preferably fewer than 15 
monomers is not used alone, but is used in the presence of 

35 other probes of equally short size or otherwise; under 
certain special conditions, it may be useful to use probes 
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..«r than 100 monomers, a prob. may be used, in 

r-o^Uc ....o ucn .o.ecu.es be.n„ 

particular, detection probes, 

^ hv anv suitable means, that xs 
5 solid support by any ^^^^^.^ covalent bonding or 

directly or indirectly, for example by 

passive adsorption, labelled by means 

. the detection probe -y ,,,,,,etive 

Of a label chosen, xn P^^^^^^^^^^^^ .oxidase 
10 isotopes, enzymes ^ hydroiysing 

..d alKaUne Phospha^^^^^^^^^^^^^^ luminescent substrate, 

, cnromcgenic, fluorogenic 

chromophorio chemical comp analogues and 

or luminescent compounds, nucleotide 

15 biotin, diagnostic purposes of the 

- ^""^^ 1 al\ Known nybrWUaticn 

invention .aV be^ ^ L tbe techni^es te^a "DOT- 

techniques, and in pa nMoRTHERD BLOT", »hlch le 

. techni.^e " sAHDWICH technique (U) ; 

Which usee «HA « ,,,nnic,ue is used in the 
advantageously, the ,,,, ^pture probe 

present invention, ^^f"""' the understandln, 

and/or a specific ^-"ction probe, ^on^^^^^_^ ^^^^ ^^^^ 

„ that the -P"/\^";;,'%,^i,uy different nucleotide 

possess an at 

sequence, ^4„„ t-o the present invention 

. any probe J, ,„a/or with DHX 

can hybridi.e in vivo v troj^ 

" ^rtirar '—n and.r transcription. and,or to 
degrade the said 0» and/or ^^^^^ 

. a primer is ^.ononers, and 

monomers, and — ^/^^.f "^Lrs, possessing a 
" rectnci; rnvbr^di^ation under particular conditions 
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of an enzymatic polymerization, for 
.or the i-^-^^°"°^,,^;,,,on technique such as PCR 
example in an ^^^^^^f/^f^^ elongation process such 
.polymerase chain transcription or the 

as sequencing, m a metno« 

- two nucleotia. or pep.iae "^^^^J^^r^ 
e^,v,Xe„. or aerive. wit. -P- " one anotKe. 
„ape=t « a reference ^^^--^ 3U.stantiaUy t>.e 
eorresponaln, "-^"^^^ItJ'Untical, as r.,ara= t« 
.0 sase role. ^^^^ ^J^^^^,,,, or in the techni^e xn 
application or use in c, particular, 
„hich they participate, t« sja ^ ^^^^^^ 

ec^ivalent If they ;^";:„,aneous mutation o£ the 

variahility, in icLentitied, or induced 

.pedes .ro. ^-^^J^l^ZlZo^s ee^ences. ho.olo«r 
variability, as are 

. ---- " p^- 

.pohtaneou. or Induced .l^ratdT^' deletion 

, /icular hy substitution ^^^L tra^ents, and/or 
of nucleotides and/or of n 

extension and/or """^^-"^ °' "^^^ £rom the genetic 

ends, an unnatural var-b^U'V - ^^^^^ ^^oice of 

engineerln, j:!^^ or otherwise, selected for 

,5 synthesis pri-ers, dagener ^^^,,^^11^, can manifest 

amplifying a nucleic aci , parting sequence, 

l«elf in oapable of being expressed by 

considered as reference, and capa ^^.^ ^^^^^^^^ 

a degree of homology relative 
30 sequence, „„„acterizes the degree of identity 

- """"^"^ fragments compar«l, it is 

of two nucleotide or P'^"''; J is determined, 

measured by the percentage ide«^y^^ ^^^^^^^^^^ 
in particular, by direct P ^^^^^^ „„oleotida or 
35 peptide sequences, relative 
peptide sequences. 
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irtantitv has been specifically 
. this ^t^'%,a^e„ts, Clones In 

astarmme. for the Lention, which are 

particular, dealt with xn the P"-" MSRV-1 
Loiccu. to the «ra,»ents identx ^^ ^^^ 

virus, by SEQ 10 "O-^!" '^-;; =3^', „ »0 = 56, SEQ 10 H0=57 
to SEQ 10 H0,«. SEQ ID »O.40 S Q ^^^^^ 

and SEQ lo »0>93, " J° i„ers identified hy SEQ ID 
.„.olo,ous to the P"^=^V « = », SEQ ID »0=16 to SEQ 
«0:20 to SEQ IB HO=24, ^ id »0=«, SEQ 

, XP »O=10, SEQ 10 «0=3i SEQ 10 »0:50, SEQ 10 

IB BO-.47, SEQ 10 -O'"' 1. »0:S7 and SEQ 10 

K0..5S, '^""■Z'" Z example, the .»U.«t 

^0,„ to SEQ lO ^■■'^'■Jl^^^^r, the different general 
percentage identity observed het- 

Consensus sequences of ^^"^^ the U,7PC 

'"^r. tr^^rX tTa protocol detailed later. iS 
Z "the region descried in Ei^ t 
- any nucleotide y possesses a 

„ or derived fro. a ""-^ f ^ the =e^-- 

" nucleotide s«,uenoe definition, the 

„£ere„ce fragment, accord.J^ ".^..lent to a reference 

tolloving in particular are 

. . .tf^arrmf^nt:: . ^'-^^^ a-t- least 



nucleotide fragment! hybridizing at least 

a) any fragment oapabl segment. 

partially with the "^^'^^^I'l ^,,,„„e„t with the refer- 
b, any fragment whose ' J . larger 

ence fragment ^ ^^^^ tses than with any other 

number of identical ,„,n„„ic group, 

30 fragment originating from „.pahle cf result- 

c) any fragment "^-^ J species from 
in,, from the natural variability 

„hich it is obtained. resulting from the 

d) any fragment =»P"' the reference 
3. genetic engineering tecm,l^es applied 
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. A«« at least eight 

c„.tl,u».s — -^f^, peptide encoae. .y 

ho-ologous or identical t 

reterenc. £r.g>..nt, different from the 

'1 -V "'^^r deletion or substitution 
reference fragment by -"^^^l^^^,,, .hortenin, at on. 

„£ at least one monomer, or ^^^^^^^^ „.^ent 

„ both of Its ' ,„^ent flanKed at one or 

:::roriren:: r.=- — - ^ 
— ^-.oiypeptide --rrinrars; fn :rtr- 

lar, any peptide of at ""'^^ £or example an 

an Oligopeptide, ^ f :;"3„,3t.ntially isolated or 
enzyme, retracted, separated - particular 
-""^"ratnerr — -thesis or by egression in a 

those obtaineu 

„oombinant organism, ^^^^^^d by a nucleotide 

- polypeptide """'''^ iyp,ptide possessing at 
fragment is understood -'"JJ^/^; .east nine 

.east three amino ac^= ^ ^^^^ „„„,.„,,,a 

contiguous monomers lying 

fragment, analogous to another 

. an amino acid is t i^ochemical prop- 

,S amino acid when J^^^^^ Jly and,or basicity 
erties, such as P°l»"«' eubstantially the 

a„d/or acidity and/or -^"^'J^^ ^„ isoleucine. 
same; thus, a leucine - ,,,„ad eguivalent or 

- an, polypeptide is polypeptides 

30 derived from a "-""-^^"^r aete properties, and in 
compared have ->'-«""^^'^ Itigenlc immunological, 
particular the ,„%ao!gnition properties, the 

.,=.1 and/or molecular ^ ^ 4.^ » reference 
en^ymological '-^^ ..^Ivalent to 

following m P^^"* 
35 polypep^i*^®' 
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.„y p-vpepu- j--"^- 

analogous ai.lno aold, equivalent peptide 

any P^^^^-'^f;^,";"; induced variation o* the 
.e,uence. *-^-'> J^'' Xude and/o. of the nucleotide 
said reference ' ^lypeptide, , 

fragment codin, for the ^^^^/^^erence polypeptide, 
c, a BlBotope o£ the e ^^^e one or more 

a, any polypeptide - ^ ^ ,„ amino acid 
•d Of the L series are repiac 
TX:l\:ri^. and vice vers. ^^^^^^^^ , 

e) any polypeptide into ^^^^^ 

cation of the side <=-;-j; ^ an acetyl.tion of the 
introduced, such «-J- J,,f„„ the thiol functions, 
15 amine functions, a ^^„boxyl functions, 

carba, reT-t"» 

^« lonoxv bonds, least one antigen 

- """''^ C,. any P-VPeptf :t J ^^^^^^ ,^.,„., . 

Which i. reccnxzed by 

.ef erence P--^ _ rs^^ .ptr^r 
- — -ri:::-n.'Tirst%o: .nd Preferah. at 

to the present inve 

vie. Of the f^. r;:::::- 

.everse transcriptase en.ymat act 

3„ cany — -r":" - - rrs p :=er- 

ing such re aescription. ^^^^^ 

according to the pr ^^^^ m the P 

The expressions of nucleotide 
• .1 and the claims, such as 
description ana 
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sequence", are not adopted so as to express a particular 
order but so as to define the invention more clearly. 

Detection of a substance or agent is understood 
below to mean both an identification and a quantification, 
separation or isolation, of the said substance or 



or a 



said agent. n k-. 

A better understanding of the invention will be 

gained on reading the detailed description which follows, 

prepared with reference to the attached figures, in which: 

_ Figure 1 shows general consensus sequences of 

nucleic acids of the MSRV-IB clones amplified by the PGR 

technique in the "pol" region defined by Shih (12). from 

viral DNA originating from the LM7PC and PLI-2 lines, and 

identified under the references SEQ ID NO: 3, SEQ ID NO: 4, 

15 SEQ ID no: 5 and SEQ ID NO: 6, and the common consensus with 

amplification primers bearing the reference SEQ ID N0:7; 

- Figure 2 gives the definition of a functional 
reading frame for each MSRV-IB/ "PGR pol" type family, the 
said families A to D being defined, respectively, by the 

20 nucleotide sequences SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5 
and SEQ ID NO: 6 described in Figure l; 

- Figure 3 gives an example of consensus of the 
MSRV-2B sequences, identified by SEQ ID N0:11; 

- Figure 4 is a representation of the reverse 
25 transcriptase (RT) activity in dpm (disintegrations per 

minute) in the sucrose fractions taken from a purification 
gradient of the virions produced by the B lymphocytes in 
culture from a patient suffering from MS; 

- Figure 5 gives, under the same experimental 
30 conditions as in Figure 4 , the assay of the reverse 

transcriptase activity in the culture of a B lymphocyte 
line obtained from a control free from MS; 

- Figure 6 shows the nucleotide sequence of the 

clone PSJ17 (SEQ ID N0:9) ; 
35 _ Figure 7 shows the nucleotide sequence SEQ ID 

NO: 8 of the clone designated M003-P004; 
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^h. nucleotide sequence SEQ ID 
. Figure 8 shows the nucl ^^^^^^^ 

o of the clone Fll-l; the P°^^^°" corresponds to a 
no: 2 of the p3,,«er corr P 

two arrows m the y .^^^ „hich wa 

variability i.posed by the cho ^^^^ ^^^^ ^he 
the cloning of Fll i» 



for the cloning - 

translation into -^^^^f J nucleotide sequence SEQ ID 
_ Figure 9 shows the n ^^^^ 

. n^res 10 "Ji^ „i„aviolet Ug" of an 

i„ the f=« °^ » r»ro=e gel. of the a.pUfi- 

ethidiuT. VrcBiOe-i-P'^-^'"* '^"^^ identltied by 

:::io„ p.oau=t= rHO,lS ana SK« 10 »0 = », 

^ ^ . Figure 12 g^ves ' ^^.^ <,f „sBV-l and 

„. the ho.olo«, ,.3i,natea HSERV,, this 

«,.t Of an -^"''-resris demonstrated by a 
hcology of at 1-=^=-^ ^^ a ho»olo«- of less 

,0 line, the absence of a 1 

. ngure 13 Shows the nucleotide seguence SEQ 
= " °^ n:^:: inho»s tne se,uence neology between 

3„ ^^^^^^^^ -ogy between 

_ Figure 18 snww 

TTRcl and the clone FBd3; v,«moloav between 

the clone JLBcl an sequence homology 

- and Figure 19 ^ 

TTBe2 and the clone FBd3 ; ^„„r^^oav between 
«,e clone JMc2^«^ ^^^^ 3„„ence homology 

th. Clones JLBcl 
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standard unrelated to the "-Ij''-' "/^^^^^ 3^ „ 

«0=» t.. clone .B., end three potential readin, «a.es 

- - " :r::o: rrrciu se,.ence . 

.0 = «» and a potential reading «a»e o. SBO XO HO,»a 

ter». o. -J-;^;-- _ Ho»olo,y «twe.n 

^ HSERV-9 retrovirus; according to 

olone FBdl3 and the HSBKV 
the clone continuous line means a 

this representation, the 

percentage homology greater than or equal to / 
percentag percentage homology; 

absence of_a ^^^^^ SEQ ID 

ho: 61 Of the clone FPe and three potential reading frames 

Of SEQ ID N0:61 in terms of ^^^^^ ^ ^ 

- Figure 39 shows the nucleouiae s^h 
H0:S9 Of the Clone O.E.A and three potential reading 

frames of SEQ ID NO:89 in terms of amino acids; 
frames ^ Q ^^^^^ ^ ^^^^.^^ ^^^^^ the 

5 region E and coding for an MSHV-l retroviral protease 

identified by SEQ ID NO: 90; 

- Figure 41 shows the response of each serum 

*=v.«™ MS indicated by the symbol (+) , 

patients sufter.n, fro,, /^ueed by (-) , tested with an 
and of healthy pat.ents, ^ 

JO anti-IgG antibody, expressed as net op 

- Figure « Shows the response of each seru. of 

*v.«« MS indicated by the symbols (+) 
patients suffering from MS, indi 

«nd roS) and of healthy patients (-) , tested witn 
and (QS) , an optical density at 

35 anti-igM antibody, expressed as net op 

492 nn; 



20 

NO: 89 
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the RT-activity profile in 
_ Pigure 43 ^;";,,,,ts fro. B-cell lines 

j.n.ltv gradients of pei „„,,„ed £rom the 

sucrose denmity 9 , obtained 

3upern.tants, """"V" ^^^^ „itochondrlop.thy . MS B-ceU 
relative of a P="-"^ '..tient with definite «S, 
5 line D wee obtained from a P=*"; ,„a amino acxd 

' . Figure 44 show. detected 

.Ugnment of the — L .Pan-retrcvirus" PC«. 
in the study (of ^^^l^J\,\.sr... and standard singXe- 
.oeletlons" are designate amino acids and 

,0 ietter abbreviations '" --/J^J, J,,,, conserved V.PQO 
nucleotides (i - ^^^.p^.te bloCs in bold type 

and YXDD regions are " ^.'^^ ,,,,3 which are present 

at the end of each sequence »ml underlined, 
in all or in all but one of the ^ ^^^.^^ 
" - -nn/rfTsenr wli^els primer ..-01 

orientated 5' to 3 l ^^^^ above (PAN u 

(antisense,. -^--'.^ ^ the primer seguences. 

P»».DI) or below (P^«> ^^^,^„ cttggatcc. J 

„I., denotes the nine ctcaagctt . The capture 

,„ d'enotes the nine base e*e-l ^„ ,^s. assay 

and detector probes ^^^ "^J,,, „sRV-cpoI seguence. At 
are shown below a """^f^^.^.iated HSRV-opol sequence 
..ree positions ^^^/^^ JJ^n^ing -non-silent- nucleic 
alternative amino acids <repr ^^^^^^^ . ^ a„d Y 

„ acid variations) are show" <,i„„es 
substitutions were only observe ^^^^^„t proportion 

Tereas « and w ^^f'^, ^.-vatlon. Hots that ^Vl 

cf the Clones l"-=P^f ° cpVlb may be blotinyl.ted 
IS peroxidase labelled and that P 
30 at the end if --""^ "n, I^ed at the left of the 
name of each seguence is 

1 . HIVi: Human 
figure. . virus type 1, 

^Vl= Human LeuKaemia virus „cloney-Murine 

Lunodeficlency Virus «pe J_ „ .V. 
-.c Leukaemia Virus, 
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. , „sRV-cpol: Multlpl. Sclero.ls 

„soci.«a Retrovirus ■=°"=="' Lyiogenlo «ee which is 
. ngure 45 f<^^ ehcoded by the 

.ased on the conserved e„ao,enous and 

5 pol gene o£ MSBV .nd "' J^ ^,,,,es with reverse 

--TpterT^r Xat^ed .V — 
transcriptase, j-i- -4..,^^-. 

p.ogra. of ,xAV: Equine Infectious 

HSRV: Human ^pu^^^retr ovxrus . ^^^^^ ^,^2: 

,0 .enemia Virus. BLV: Bovxne -u^^^^^^^ ^ ^ 

Hu^an X— °^^^^^^^"°^/Xres ^YPe 1 and 2. F-Mu.V: 
HTLV2: Human Leukaemxa Vxr ^^^^^^ „oloney-Murine 

■ A Murine Leukaemxa virus. vir-us. GaLV/ 

Friend-Murine Endogenous Virus. ^ f 

Leukaemia Virus. ^AEV. B ^^^^^^ ^^^^^ Endogenous 
15 Gibbon Ape Leukaemia Vir . ^^^^ i^tracisternal A-type 
Retroviral sequence, clone . ^eRVI^iO: Human 

particle. MPMV: Mason-Pfi.er K^ey ^^^^^ ^^^^^^^ 

Endogenous ^-^'^^^^^^^^ J'^,, sequence) : Human sequence of 
Virus. HSERV9 (ERV9 database seq^^^ ^^^^.^^^ sclerosis 

20 Endogenous R^^'^^^'^^^^'' ^^^ian Immunodeficiency Virus; 
associated Retrovirus. SIV Simi ^_ ^^^^^^^^ 

rTLV-H: Reverse retrovirus; SIVi: Simian 

Simian Foamy Virus; VISNA. ^.^.^^ Retrovirus 

immunodeficiency -^o.irns H. 

,3 type 2; SMRV-H: „3RV sequence in t.e 

protease and Keve^ 

gene. . ^3 aligned under the 

,,e aminoacid tra"slff ^ region 
30 corresponding ""'^^t^^^f oRF cloned in a recombinant 
corresponding to the ^-^-^^^^J ^,,ed. The regions 

vector and ^3 fragments amplified on plasma 

corresponding to the A and Bj-^^^^^^^, .rackets.The 
samples from MS pat-nts a ^^^^ ^ ^^^^ 

:::::%'rror^^^^ - high, conserved ammoacids 
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and/or active sites of enzyme activities of both PRT and 
RT (including RNH) are shown underlined. 

- Figure 47A illustrates the pecific detection 
of MSRV-pol RNA sequence by RT-PCR in the sucrose density 
5 fraction associated with RT-activity and in MS plasma ; 
Figure 47B shows the RT-activity profile on a sucrose 
density gradient obtained with extracellular virion 
pelleted from an MS choro id-plexus culture. The photograph 
below shows an agarose gel loaded with PGR products 

10 amplified from round 1 (STl.l) RT-PCR products with the 
ST1.2 primer set. From left to right: water control 1 from 
RT-PCR step with STl.l set; water control 2 amplified from 
water control 1 with STl.2 nested primers; Molecular 
weight markers; Fraction n»l to 10 corresponding to the 

15 RT-activity profile shown above; Plasma samples CI and C2 
from healthy blood donors. Plasma samples MSI and MS2 from 

two MS patients. 

- Figure 48 shows an example of a variant and/or 
recombined sequence in the region of the pol gene defined 

20 by homology with the overlapping regions described in 
Figure 25, as GM3 , MSRV-1 pol*, t pol and FBd3 . 

- Figure 49 shows the nucleotide (Figure 49A) 
and amino acid (Figure 49B) alignments of the pol region 
between clones 1, 5 and 8 of the same patient (Experiment 

25 46-7). 

- Figure 50 shows the nucleotide (Figure 50A) 
and amino acid (Figure 50B) alignments of the pol region 
between clones 41, 43 and 42 of the same patient 

(Experiment 68-1) . 

30 _ Figure 51 shows the nucleotide (Figure 51A) 

and amino acid (Figure 51B) alignments of the pol region 
between the consensus sequence (SEQ ID NO: 176) of clones 
1, 5 and 8 of the same patient (Experiment 46-7) and 
SEQ ID NO:l, and between their corresponding peptide 

35 sequences. 
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and a^ino add (Figure 5») al , ^^^^ 

between tne consensus „,,eri.en. 6S-., an. 

5 SEQ ID »0:l, and 

sequences. nucleotide (Figure 53M 

- "^^r " ,u,n»ents of the pol region 

„a amino acid (f^^"" ""^J^' id »C= ive, of clones 

bet»een the consensus ' ^^p^^i^ent 46-7) and the 

iO X, . and . Of - rie,>'of Clones «, « and 

rrrftr description) sno. 

Table 5 (at f ' J";^.,,^ „ith degenerate pol 
«,e sequences <'«"f'' .^J „„i,« fractions =ontaxn.n, 

" --"Tonracrx i" — ---^ 

the peak of RT a 

Example 18) ; and description) shows 

Table 6 (at the end aetection by 

i^nical data and results of Mb ^^^^ 
the clxnical a ^gcific ELOSA assay, on ^ 

*.^«n PGR With speciJ-j. 

" :r;:roira.-ts(cf..a»pieis>. 

„ ^ ^ C01«FEC.1VB »0m« J or «TROV»a8» <« 

.... the technique 
^ ^ technique derived -:c»J^^^^ ^^,,,,3 

30 published W Shih •';;'::te removed hy treating .11 

tu trace Of con«-nant ^.^^^^^ ^^^^ ^'^.Z.lJ^ 
^' components ot of dx 
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all trace ^ed.UTn ^,^^^„t 

the components of t ^he use of dxff 

concomitantly maKes P^^^^^ .recessive series of PCR 

,.t overlapping P--- - ^ ....ces of amplxfy- 

amplification cycles ^ xn ^^^^^^ 
ing a cDNA synthesized 
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In the sample by 

, ^„ ou«.t and further reduced 
small at the ou ^^^^^ „„ the ^^^^^^ 

spurious a^^n " ^„„,,,,„„s ot a^xv.ty^^ ^^^^^^^ 

„hich enahle all "a of the P» 

, hefore ina=t-t-n o ,,,s varx-^ ^ ^^^^ 

''^ "^'^'"^ IscrlL Shlh '"'.r:\rfractions of 

""""''"ted fr- '„ a -crose gradient 

synthesized ' p„rllied <"> ' ^ „ perron d^) 

infective \\,nni,ue ^-="'-:,„7„o", " educed hy 

line (^»CC NO. «"""'j3";,oBie, produced by 
the PU-2 l"-" \ (ECACC NO. V93Ui 

,.c. the MS7P. -o^»- ^ on the 

1»7PC ixne ^^^^^^^ -'=«^^"\," Applications 

-ThPse cultures v/ere patent apf 

" r:: ts oisi and »o «/.oi«. ^^^^ 

p„.Ushsd under «os^«^^^ --rar"^.-- ^ ^c 

w TA Cloning Kitv^ automatic 

technique w th the T ^^^^ ^^^^^,,^3 ^I'^J'^^,^ the 
,0 sequence us.n, - PP^^_^^^^ ^^^^ ve»^» °' 

— ^r.^^- -.:'t:"to%ype= ^ 
» .r:rr:r;ypror::::n- - rn::^"^^ 

the POL-2 isolates msVPG" isolates of 

^^iainating f^®™ ^ ^.^ a family 

°'°"%sr corresponds " ;.„„,nt «o., the 

30 cultures), « to, „r HSERV-9, 

sequences =1°=*''' „ovirus designated „as to 

endcenous sequence vhich jorr- P^^_^^^__^^ 

^ ^^°Cy """""llTor P^-noqenic aqent 

sequences very i„(ectlve and/or P 
35 attributed to another 
designated BSRV-2. 
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the 
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«f seauence, representing 
The first type of sequ ,^ ^^^^^^^es whose 

.a^ority of the clones, J^^^^^^ .^..ences to be 

variability enables ently similar to one 

another for it to be P retrovirus, as is 

,uasi-species -^^-^^ ^^^I^^^^ (l.) . or to be the 

well Known for the HIV 1 ^^^ogenous provxruses 

outcome of ^"^-'^^^"""'^^"'tJ Llls. These xaore or less 
coreg^lated in the produc-. cell ^^^^^^^^ ^^^^ 

defective endogenous a replicative 

regulatory to the sa.e family of 

provirus, since they belong ^^^^^^ 
endogenous retroviruses ^^^^ ' ^^^^^^^^^.ely this new 
endogenous retroviruses, o ^^^^tion of quasx- 

,5 retroviral species and which contains a 

— ^-rt:e"e: a^^^ - -^-^^^ 

consensus ot tne » m 

»SKV-1B- „„s,„ts the general consensus 

«,ur. 1 '"^'T' t the different MSRV-IB clones 

„ sequences oe tne ---« ,,ese ^^^'^'^^^^^Z 

sequenced In thxs exp „ sEQ 

identified, respectively, hy SEQ^ ^^^^^^ ^^^^^^^ , 

IB »0=5 end SEQ ID J^l^,^ „n,in, froB 70% to 

roBOlo^ with respect -^Z.lT«^e. X571,7 and H3,e3» 
„ 88% with the HSERV9 --^"^""/^^^ ..eonsensus" nucleic ac.d 
" ,ne oenebenK. date hase Four p,,i„ of a 

ee^ences representative of dif ^^„,„„^ 
possibly excenous r.trov.rus 

subfamilies of an -^"^^^^/^.r^^onsensus se.p.ences are 
,0 defined, ^'se repr-n-t.ve .„no 
presented in „a.e exists for each 

A functional reading seen 

ra»UV c^ -^rrdZTa-e corresponds In ea^h 

- -t.nal r r^l .e 

" trrUh: ulr the nudelc acid sequence. ^ 
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consensus o£ th. '^Hec^Ce 1" ^^''^^o" 

H0:7 ana obtained BY this Pa> 

region, ie presented in Fijur. 1^ „presentln, the 

The second tYPe °t „p„.ent«l by the 

. .aioritv o. the -ones se^nenc- J ^ P_^^ ^^^^^^^^^ ^ 

„^.nc. MSBV-JB P-;-"^J^^ „L,„ed in the sequences 
SK ID ho: 11. The '^^""'""^ explained by the use 

co^espondin, to the PC« pri»«e a« ^^^^ 
o£ degenerate primers in »«tur 
10 technical conditions. ^^^.j, is sufflo- 

The HSRV-2B -^-■''^J^^,^,! sequences alreajiy 
lently divergent £ro. the retr ^ ^^^^ ,,,, 

described in the data ban.s fo ^ ^ 

the sequence "9^°" ^^r/r^^: infective agent would m 
15 agent, designated HSEV-2. Thi ^ ,,,et 

principle, on the ^„ , retrovirus but, in 

sequences obtained, f^^'^f ^.^aning this sequence, it 
of the technique used tor o ^^^^ ^„,^ 

could also be . VUK — "^'^ ' „,,rse transcript... 
,„ Which incidentally P^-^^^.^,,,, «ith the hepatiti. 
activity, as is the oa-.J- ^^^^ ^^^^^ ,ne 

B viru., HBV . Further^or;, .„pUfication 

regenerate primers J°%,„i„ed, as a result of 

technique may very conserved sites in 

„ unforeseen sequence ° amplification of a 

" The gene for a rel^d -V-e, ^^^^^^^^^ ^„„c 

"IndTor cotfective agent .protist, . 
pathogenic and/or c 

- » same te^.^-^ ^ptfT 

the technique of sni 
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reverse transcriptase ^..cribed by H. Perron (13). 

according to the "°''"^! "tr^ltioned in Example 1, 
, „d accoxain, to tne „,,aned by seU- 

. spontaneous lymphoblastoi ^ ^^^^^ „s 

l«ortallzatlon in -"^tive tor the Epstein-Barr virus 
patient who was — P"""';^ ^hoid cells in 

,EBV) , after setting containing a suitable 

10 culture in a suitable ^presentation of the 

concentration of =VClospcr- A ^^^^^^^ 
reverse transcriptase ^ct^^^J produced 

TaKen fro. a P-«^""°"jrtn n ure Si-ilarly. tne 
^ this line is presented in g^^^,__^_^ ^^^^^ 

15 culture supernatants of « B ^^^^ „„4er 

conditions fro» a control free f^^ ^^^^^ 



tne s.Be -""^""f '.Z^, sucrose gradient fractions 
transcriptase J" "'Unround, , and is presented 

proved negative throughout (^-^ corresponding to 

„ In Figures. ""^^ fraction without reverse 

MS B line and the ^„„„ci ^.dient were 

transcriptase °l,;^\.chni,ue as before, derived 

„.lysed by the » ^ steps of cloning and 

::::erc::g as'^escri^d^. -^^^^^^^ ... ... 

It is particularly ^^^^^ ^^^^ „ the 

„d MSBV-2 type sequences are „^,.uKe" reverse 
..terial associated «on, the MS B lympbo- 

Iranscriptase activity ""^m-t""^ £ound with 

,„ Tlastoid line. These seguenoes w«e ^ i^pnobl.stoid 

the material fro. at random. Only 

Une in - recombinant cl-s ^^^^^^^^^^^ 

« TV tvoe contaminant sequen ^^jjj,^ 

Mo-MuLV type ^viw'hase used ^oi. 

reverse transcriptase particular 
coiranercial reve ^g„„ences without any par 



wo 98/23755 



PCT/IB97/01482 



38 



of the "consensus" amplification of homologous 
result of the produced by this PGR 

polymerase sequences whxch is prod 

^«^Hr,iflue Furthermore, the absence oi a 

technique. Fu amplification reaction in 

target which J^^^^^ J^^^^ amplification of dilute 

5 the control sample permi results is manifestly 

contaminants. The difference in results 
highly significant (chi-squared, p<o.ooi). 

EXAMPLE 3: OBTAINING A CLONE PSJ17 , DEFINING A 
vx.ns TsRV-X BV KEACXION OF ENDOGENOUS REVERSE 

" Zl7Zir:.:n . virion preparation originating frok 

THE PLI-2 LINE directed towards obtaining 

This approach xs direcT^ea 

mis o^h'f ^^r.m the supposedly 

«^ transcribed DNA sequences from the supp 
reverse-transcrioeu reverse trans- 

,s retroviral m .he isolate us , the re^_^^^^^ 

..iptase activity P""« " theoretically function 

reverse transcriptase -""^^ "^^^.^^ UnKed to a 

only in the presence of a "^-"^ 

primer tBHA or hybrxdized ,,,,ieles 
,0 already -erse™^^^^^^^ n ^^^^^^^^^^ ^^^^^^^^^ 

,16). Thus, the """'"'"^ P ^^^^^^ .oids was 

in a material ^^-^^"^^^^^^^ Z..ors hy «ans o. the 
optiBizea accoraing to tn portions of viral 

•-"nhT^ir rre^;— ;re Ltivlty. .o this 

" r r a:thrUr»inea ----- vrr™ 

conditions under which this enz^tic acti V ^^^^^ 
transcription '"^^ tions correspond to th. 
effective xn ™ protocols presented helow 

technical °L "'^„' ,,oation, clonin, and 



,0 J-;; \,.otion, purification 

( endogenous R i 

se^encin,)^^ molecular approach consisted in using 



a 



The molecular approac. ^^tained 
preparation of concentrated hut unpu- ed v rio 

3, fro. the culture -P~- ^ ^ ,,,od: the culture 
according to x:ne 
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are collected twice veeKly, precentrlfu,«i at 
supernatants are con ^^^^ ^^^^^^ 

,„.„00 rp. ^° ,,,, „e for the foXlowin, 

£ro«n at so C or „^t,„t, are centriluged or 

="^^\™%r3« Jlyte^oWBS at 100.000 , ,or 30,000 rp. 
5 a cushion of 30% giycero ^ 7 h at 4»C. After 

in a type 45 T LKB-HITACHI rotor) for 2 h at 4 
in a type sedimented pellet is taKen 

removal o* the ,„„3titutes the traction 

"nLCer u Umed virion. Thie concentrated 
o£ concentrated B , ^. 

" ririnCnor rerr: tranecnption reaction, ae 

described below. purified according 

A volume or 20U toj. wj- . . ^ v-overse 

described above, and containing a reverse 
to the approximately 1-5 mllion dpm. 

.5 transcriptase ac-v^^^^^ , ^..^e appears, and then 

is thawed concentrated buffer was prepared 

placed on ice. A 5 foi Tris-HCl pH 8.2; 

the ^°\^«'::i"2acr 7rnl 'OTT and 0.10% HP 40; 100 .1 
75 itM Naci; 25 mM MgClz, 

«i of a 100 mM solution of cmif 
,0 of 5X buffer * » »1 °f „j ^ ^ solution 

°' a 100 -^>'";'"/^;„^'^' 3„,„Uon Of dCTP . 100 „1 Of 
dGTP + 25 -1 » „j ,i,ion suepenaion 

sterile distilled water * 200 »1 

,^ a^ivitv of s ""^^-J"*,;; Z, Incuhation. the 
„ incuhated at for . hou«. Af^^^^^^ ^ 

reaction mixture is a ^i^ture (Sigma ref. 

phenol,chlorofor»/ieo..,l alcoho » -^^^^^ ^^^^^ 

P 3,03), the the organic phase to 

sterile distilled water is added The 
30 re-extract the residual nucle^^_^^^^ ^^^^^^^ 

collected J^'y;,,,,i„ted hy adding 3H sodiuB 

acids contained are P v volumes of ethanol + 

acetate pH S.. to ^J-^^^^J^,^ ,01 3«, and 

1 Bl Of glycogen (Boehring ^ ^ 

3, placing .£,er centrifugation is then 

The precipitare 
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purified, cloned and ^lunt-ended DKAs with 

„ni=h wiU no« b. de,«i ^^^^^^^^^ . „,uun,- 

unpaired adenines at the en previously 
J reaction was first ^^["^^^^Z:^ a .1 of a ..5 «H 
purified D». lounts, d^TP . dCTP . 

solution pory^erase CBoehrin,er-K.nnhel« 

a„p t dCTP/l ml of T4 DNA P y buffer for 

ref. 100, 7se, '/"^: „,,,.„,„„hei. ref. 1417,7.) / 
restriction -^j;"" '"^'^ \,,u»ln solution / le ml of 
. .1 of a 1% '>°""r „i«ure was incuOated for 

sterile distilled water This ^ ^ 

ll-C. 50 ml ol -flded 



,0 .mutes at 11-c. 50 »i " „3, „ere added 

glycogen (Bo.hrln,er— ref. ^^^^^ 
thereto hefore «tra= °n "f ^^^^ ^ 



thereto before "^^^^.^^ ,si^ ref. P 3803, and 

phenol/chloroforB/rsoamyl alcohol < ' ^^.^^ 

'precipitation with ^^^--^.^^f is resuspended in 

DNA precipitated J ^ „i this suspension 

XO -1 Of 10 Tris huffer pH 7.5 5 ,t 3 ^ 

were then mixed with .0 ml of 5^ ,„pUta<,TM, and 

1 .1 °^ ^::,™ed iater. This mixture Is 

„ ml of sterile ^-^-^^^^^^ , of oil on the 

Incubated for 2 h at 75 .„aed In the aqueous 

„ surface of the solution The .»A P^^^ ^^^^^ 

solution drawn off under the fj^m , ml 

is precipitated as described ^^^^^^^^ 
of sterile distilled wate^. «.e ^ 

l„to a Plasmld usin, ^^^^^^e distilled water. 

30 solution were mixed with 5 ml o „^„, 

, „X of a 10-£old , „s n,/»l) and 

LIOATIOH BUFFER", a ml °' P ^^.^^ated 

^ »^ °' iTc The ouol, steps were carried out 

overnight at 12 C. Tn cloning^" Icit 

according to the i-truc^^-; of ^^^^^^^^^^ 
(British Biotechnology) • At the 
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White colonies of r.coBblnant bacfri. (white) were picked 
out in order to be cultured and to pernlt extraction of 
the plaemida incorporated according to the so-called 
"Biniprep" procedure (17). The plasmid preparation from 
5 each reco,*ina„t colony was cut with a suitable 
restriction enzyme and analysed on agarose ^^l- 
possessing an insert detected under »v 1^^" 
staining the gel with ethidium bromide were selected for 
sequencing of the insert, after hybridization with a 
10 pri.er complementary to the Sp6 promoter present on the 
Cloning plasmid of the TA cloningTH ^it. The reaction prior 
tl se^Lncin, was then performed according to the method 
recommended for the use of the seguencing J^-^-^ 
reaction Wt dye decxyterminator cycle 
15 (Applied Biosystems, ref. 401384), and automatic 
le^encing was carried out with an Applied Biosystem. 
rArometic Sequencer, model 3,3 A" apparatus according to 

the manufacturer's instructions. ^„,,.h data 

Discriminating analysis on the computerized data 

,n banks of the sequences cloned from the DHA fragment. 

" : "ent in the reaction mixture enabled a ""---/^ 
sequence to be revealed. The corresponding clone PSJ17 
Tmpletely — a. .e^^^^^^^^^^ msTatrsfd 

" ^ta banks. An identical sequence already ^^^^^^^ '^^^^ 
It be found by analysis of the data banks, only a partial 
not be lo retroviral elements was to be 

— hrrtTserUativ ho^^^^^ 

30 endogenous retrovirus designated ERV 9, 
according to the references (18) . 
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Five oU^cnuoleotUes H 
= .00, ana .00., ^^^rfrt.^ZZs. control reaCions 
..l,i„atin, frc» P-^'f^^^^U *or the presence o£ 
perfor^d so as "J a-pUflcatxon 
contaminant (reaction wxth » > 

„ consists oe an ^^-^^ ^'^Zn^.. .V a "nested- PC 
described in Example 2 c i^ed In the document 

accordln, to the ^^J'^^^^^ cycle, the primers 

Ep.A-0,569,272. I" ^^cond PCB cycle, 

^"-^ ""U^rir :03C .na the pnmer POO, are 
r.d"r;rrs are positioned as follows: 



15 

The priiti' 

M002-A 
M003-BCD 

" P004 P005 



MOOl 



20 



.RNA 



POL-2 

< 

pol MSRV-IB 



25 



PS J 17 

Their composition is: 



30 



Their compositxon is. 

primer M002-.= ^^^^^^^.^.^LtCT.T (SBO ID »0:32, 

,.i„er «-;-^=^;;^::i:„C.TC».^ <3.= ID »0:», 
primer P004: AACCCTTTO (jbq id HO-.24) 

primer P005= '^^'''^'='=''''^^Z,^ product obtained, and 
-nested" -P-'-^^J ,i^re ,. and 

designated H003-P004, js P 
corresponds to the sequence SEQ 
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EXAMPLE 5: AMPLIFICATION AND CLONING OP A 
PORTION OP THE MSRV-1 RETROVIRAL GENOME USING A SEQUENCE 
ALREADY IDENTIFIED, IN A SAMPLE OP VIRUS PURIFIED AT THE 
PEAK OF REVERSE TRANSCRIPTASE ACTIVITY 
5 A FOR technique derived from the technique 

published by Frohman (19) was used. The technique derived 
makes it possible, using a specific primer at the 3 end 
of the genome to be amplified, to elongate the sequence 
towards the 5' region of the genome to be analysed. This 
10 technical variant is described in the documentation of the 
firm "Clontech Laboratories Inc.", (Palo-Alto California, 
USA) supplied with its product "5 • -AmpliFINDERTM RACE 
Kit", which was used on a fraction of virion purified as 

described above. 

The specific 3' primers used in the kit protocol 
for the synthesis of the cDNA and the PCR amplification 
are, respectively, complementary to the following MSRV-1 
sequences : 

cDNArTCATCCATGTACCGAAGG (SEQ ID NO: 25) 

20 amplification : ATGGGGTTCCCAAGTTCCCT (SEQ ID NO: 26) 

The products originating from the PCR were 
obtained after purification on agarose gel according to 
conventional methods (17), and then resuspended in 10ml 
25 of distilled water. Since one of the properties of Taq 
polymerase consists in adding an adenine at the 3' end of 
each of the two DNA strands, the DNA obtained was inserted 
directly into a plasmid using the TA CloningTM kit 
(British Biotechnology) . The 2 ml of DNA solution were 
30 mixed with 5 ml of sterile distilled water, 1 ml of a 10- 
fold concentrated ligation buffer "lOX LIGATION BUFFER", 
2 ml of "PCRTM VECTOR" (25 ng/ml) and 1 ml of "TA DNA 
LIGASE". This mixture was incubated overnight at 12«C. The 
following steps were carried out according to the 
35 instructions of the TA CloningTM kit (British Bio- 
technology). At the end of the procedure, the white 
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15 



. ^ bacteria (white) were picked out 
colonies of "r^Ho P--^^ extraction of the 

in order to be cultured and to P ^^.^alled "Bini- 

prep" proo^aure (n) • ^'^'^ ^ suitable restriction 

recombinant colony was c ^^^^^^^^ possessing an 

en.y»e and analysed on «.ro« ,el. ^^^^^^ 
insert detected under OV li9« a^ ^^^^^^^^ 
ethidiu» bromide , primer ccnplementary to 

insert, ^'^^ ^X:::: Z t: oZir. Pla..ld o. the - 
the sp6 promoter P«^^" „ se<3uencing was then 

Cloning™ Kit. The f recommended for the use 

performed according 'o;- reaction .It dye 

Of the sequencing kit: (Applied Biosystems, 

.eoxytermmator 'l'^^ ^ZZTZ^^'^^^r.^ ^^^ carried out 
re£. 401^4), and aut<^at.c ^^^„^„,„ „odel 

.ith an APPiiea Biosystems M ^„„«cturer.s 
apparatus according 

i„structions^ .schnlgue was ^^^^t^ oZ^Z 

tactions Of virion P^-^t^-e^-r" --I Une on the 
,he ■.«>I.-a" ^-J^f produced by the X«7PC 

one hand, and from the «S7PG supernatants are 

line on the other hand. ™ ,^ io,000 rpm for 

oouected twice »««y. P-;«"^^;| ^,^„ at -.CC 

,0 minutes to «"-%°;",^nollowln, steps. The fresh or 
„ used as they are for ^^'J . cushion of 30% 

thawed supernatants are -ntr^'^ , type 45 T 

,lycerol-PBS at i""-""" / 'X After removal of the 

LKB-HITACHI rotor) for 2 ^^^^^^ ^„ , 3^11 

supernatant, the ^'^^-^J'^j;'^l ,„,tion of concentrated 
volume of PBS and constitutes ^^^^^^^^ ^.^^ ,^en 

but unpurlfied ™; ,^rile PBS buffer (15 to 

applied to a sucrose ^-;-";j;^„„,,„,ad at 35,000 rpm 
50* weight/weight) ana ^^^^ ^ swing-out rotor. 

» (iOCOOO g, f- ^^ ^^^ are withdrawn from 

10 fractions are 
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30 



• t PCT/IB97/01482 

WO 98/23755 



45 



10 



15 



20 



25 



30 



35 



each fraction after homogenization to assay the reverse 
transcriptase activity therein according to the technique 
described by H. Perron (3). The fractions containing the 
peak of "LM7-like" RT activity are then diluted in sterile 
PBS buffer and ultracentrifuged for one hour at 35,000 rpm 
(100,000 g) to sediment the viral particles. The pellet of 
purified virion thereby obtained is then taken up in a 
small volume of a buffer which is appropriate for the 
extraction of RNA. The cDNA synthesis reaction mentioned 
above is carried out on this RNA extracted from purified 
extracellular virion. PCR amplification according to the 
technique mentioned above enabled the clone Fl-11 to be 
obtained, whose sequence, identified by SEQ ID N0:2, is 

presented in Figure 8. 

This clone makes it possible to define, with the 
different clones previously sequenced, a region of 
considerable length (1.2 kb) representative of the "pol" 
gene of the MSRV-1 retrovirus, as presented in Figure 9. 
This sequence, designated SEQ ID NO:l, is reconstituted 
from different clones overlapping one another at their 
ends, correcting the artefacts associated with the primers 
and with the amplification or cloning techniques which 
would artificially interrupt the reading frame of the 
whole. This sequence will be identified below under the 
designation "MSRV-l pel* region". Its degree of homology 
with the HSERV-9 sequence is shown in Figure 12. 

In Figure 9, the potential reading frame with 
its translation into amino acids is presented below the 
nucleic acid sequence. 

EXAMPLE 6: DETECTION OF SPECIFIC MSRV-1 and 
MSRV-2 SEQUENCES IN DIPPBRBNT SAMPLES OF PLASMA 
ORIGINATING FROM PATIENTS SUFFERING FROM MS OR FROM 
CONTROIiS 

A PCR technique was used to detect the MSRV-1 
and MSRV-2 genomes in plasmas obtained after taking blood 
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. suffering from MS and frota non-MS 
samples fro. patients sufferxng 

controls onto EDTA. performed 

Extraction of the BNA ^ chomzynslci 

, ,.0), a«er -->*-^Jtc X -1 of plas»a stored frozen at 
guanidinlum thiocyanate to 

'"^r:^::":";- pcb „ae per.or.ea under t.e ea.e 
. the following primers: 

condition, and ^^\,,„,,,,e. SEQ ZD »0:» 

r--r^=rd:;s.-o=. 

5. GCATCCGGCAACTGCACG 3'.^^^^ ^^^^ ^^^^.^^^ ^.^^ 
However, similar resu ^^^^^.^^ ,.pxif ications 
,5 the following PCR pr.-r-; ^ ,,,,3 not treated 

by ..nested" PCR on samples 

with DNase. this first step of 

The primers used ^ . 48»C are the 

,,.,es with a hybridization temperatur 

- --owing:_ ^ ^^^^^^^ iae---Vl" ---^^^^^^ ^ 
5. MSRV-2 PCR primer, for a 

patients, -^ontified by SEQ ID NO:28 

.3- primer, identified Y ^„^i„g to a 3 • 

5. GCATCCGGCAACTGCACG 3 , corr^p ^^^^^^^ ^^^^ 

MSBV-2 PCR Pri-r, for a fxrst 

patients. ^1 of the amplification 

After this step, 1 ^^^^^ ^ second, 

30 product are talcen and ^.^h primers located 

so-called "nested" PCR ^'^^Y.^,^. This second step 
within the region already am xf.^ ,y..i.i.ation 
taxes place over 35 cycles w ^^^^^^^^ ,3 

(.-annealing") temperature of 
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■ used for this second step are the 

The primers usea toi- 

following: .^»„tified by SEQ ID NO: 29 

. 5- primer, -<*-"^^^^^^^^^^ \ corresponding to a 

- St d PCR on samples from 

5- MSRV-2 PCR primer, for a ne 

patients, 4,„.„tified by SEQ ID NO: 30 

3. MSRV-2 PCR primer, for a nest 

patients. «„niif ication was performed in 

MSRV-X, .cia sample «ea«. 

two steps. Furthermore, the nuclei 
beforehand with B»a.e .nd . con ^^^^^^^^ 
reverse transcriptase) i ,^.t the RT-PCR 

amplification steps « „srv-1 I" 

amplification co,es „,^^„,t RT, the initial 

the event of a ^^^^^^Zt treated with PHase and 
aliquot sa»ple of RNA i ■J 

ai^lified again. treatment with DNase lacking 

protocol for ^^^^^^^^ 

BMAse activity is « .^^^^ inhibitor" 

^Uguoted in the presence ^^^^^^^ , 
(Boehringer-Mannheim) m " j„ j „1 

,i™.X concentration of J^^Vinnheim, and 1.. «1 of 
n«,«e-free D»se" (Boehring ^^^^^^ ^ 

, huf far cent „i„g 0^1^M, ^^^^^^ at 

ngSOj IS added, tn ^„ , .thermocycler" . 

jo-c and brought to 96-C for • ^^^^ performed 

me first BSRV 1 .nation method as 

according to a «rl.nt "f «.X P i„ 
described in Patent ^^l^^" performed at 42-0 

particular, the cDNA 'i'"^"";; J^^^^<,„ taKes place over 
-r on. hour, the - --.^-^..^tion (-annealing., 
Tem^raru;. oTsVc. .he reaction volume is 100 ml. 
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The primers used for this first step are the 

following: 

- 5* primer, identified by SEQ ID NO: 16 
5 • AGGAGTAAGGAAACCCAACGGAC 3 ' ; 

5 - 3« primer, identified by SEQ ID NO: 17 

5 • TAAGAGTTGCACAAGTGCG 3 ' . 

After this step, lO ml of the amplification 
product are taken and used to carry out a second, so- 
called "nested" PGR amplification with primers located 
10 within the region already amplified. This second step 
takes place over 35 cycles, with a primer hybridization 
("annealing") temperature of 53 'C. The reaction volume is 
100 ml. 

The primers used for this second step are the 

15 following: 

- 5' primer, identified by SEQ ID NO: 18 
5 ' TCA6GGATAGCCCCCATCTAT 3 ' ; 

- 3' primer, identified by SEQ ID NO: 19 
5 • AACCCTTTGCCACTACATCAATTT 3 ' . 

20 Figures 10 and 11 present the results of PGR in 

the^ form of photographs under ultraviolet light of 
ethidiuro bromide- impregnated agarose gels, in which an 
electrophoresis of the PGR amplification products applied 
separately to the different wells was performed. 

25 The top photograph (Figure 10) shows the result 

of specific MSRV-2 amplification. 

Well number 8 contains a mixture of DNA 
molecular weight markers, and wells 1 to 7 represent, in 
order, the products amplified from the total RNAs of 

30 plasmas originating from 4 healthy controls free from MS 
(wells 1 to 4) and from 3 patients suffering from MS at 
different stages of the disease (wells 5 to 7) . 

In this series, MSRV-2 nucleic acid material is 
detected in the plasma of one case of MS out of the 3 

35 tested, and in none of the 4 control plasmas. Other 
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extensive series confirm these 
results obtained on more extensive 

..e bottom pboto.rapn ^^^^^^l^l^Zs^Z 
result of specific amplification by 

«eU «o. i contains ^ ^^^^^^^ 

water aione. without -^---^^ 
transcriptase; well No 2 ^^^^^.^^ ^ 

produced with water alon ^ ^^^^^^^^ ^ ^ 

, reverse transcriptase, we contain, m 

»olec.lar weight j";,! ..e .otal »B 

order. the products anpUn ,,„uected in . 

extracted «o. sucrose . pellet of vir.on 

aownwara direction) . - "^^f^J^, , ,„i,^e infected «i«> 
, originating £ro« a -^"-^"^^ e^ilibrium according 

and «as ^^„„„ ,13) , to veil » 

to the protocol described by H. ^ ^pUfied 

nothing «as applied; to we „i,i„ating £ro» 3 

-^-";Lri:ntrsrtin n/o» «S at dl^erent stages 
20 different patieniis 

of the disease were "PP""*; „<.„e is indeed to be 

B,e HSKV-1 """"^"^ „\i,„ oontalnlng the peaK 
found m the sucrose ^'^""^^ .^..ured according to 
of reverse transcriptase act. Y _ ^^^^ , 

,5 the technique described by H. ^.^lent, placed in 

" throng intensity ^'^'^^'^Jjl^T J taKen place in tbe 
«ell HO. 8). * "^oUly corresponding to BHA 

first fraction (well No- ■ ^t the surface 

released by lysed ^^^.^ \,^egated debris has 

30 Of t»e gradient; ^-^^^/^J^^^ (,„,e botto.) , carrying 
,edi-ented in the last ^^'^ ^ich have given 

with it a few copies Of t*e ^ 

rise to an amplif io«ion "'^ i„ this series, MSOT- 

ot tbe 3 P""^' ;";„,„,i„, a very intense 
„ 1 K»A turned up in one case, 
amplification (well »o. 



FCT/IB97/01482 

WO 98/23755 



50 



this series, the MSRV-1 retroviral RNA 



«robablv corresponding to particles 
genome, probably J extremely 

«v4-Taeellular virus present in rne pj-o 

extracexxuxar ..nested" RT-PCR in one case 

small numbers, was detected by nestea 
5 Of MS out of the 3 tested. Other results obtained on more 

extensive series -"^^^J^™^^^^^^^^ sequences 

Furthermore, the speciricxT:y ul ^ 
a:^Xlfled by these FOR techniques .nay be veritied and 
evaluated by the "ELOSA" techni<iue as d.scrxbed by 
10 F. Mallet (21) and tn the document _^ 
For MSRV-1, the products ot the hasted FCR 

K» *-oci-ed in two ELOSA systems 
described above may be testea in uw 

tabling a consehsus A and a consensus of MSRV-1 to 

be detected separately, ccrrespondin, to the 
15 described In Exa«ple 1 and Figures 1 and 2. In 

sequences closely r.s..blin, the consensus are to be 

fo^d essentially in the RBA samples oriqinatxn, £r^ 
™<V-1 virions purified £ro» cultures or a»pli£l«l u, 
Tracellular biological fluids of MS patients, .hereas 
20 the sequences closely re.e.blin, the consensus * are 
essentially to be found in normal hu»an cellular DMA. 

The EU.SA/MSRV-1 system for the capture and 
specific hybridisation of the PCR products of the 
Zlil. A uses a capture oligonucleotide cpVlA with an 
„ ^nd at the end and a biotinylated detection 

rigcnucleotide dpVlA having as their sequence, 

respectively: 

- cpVlA identified by SEQ ID NO: 31 
5- GATCTAGGCCACTTCTCAGGTCCAGS 3', corresponding 
30 to the ELOSA capture oligonucleotide for the products of 
MSRV 1 nested PCR performed with the primers identified by 
SEQ ID NO:ie and SEQ ID N0:17, optionally ^^^^^wed by 
SEQ i.u nw.J- i^^K.<-if iPd bv SEQ ID N0:18 

amplification with the primers identified by 

and SEQ ID no: 19 on samples from patients; 

. dpVlA identified by SEQ ID NO: 32; 
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l<jenti£i«* by SEO 

from patients. the capture and 

«>e .X^SW«SRV-1 sye^- ^ the 
specific hybridization o£ ^i„ti„,i,ted detection 

subla-lly uses «>e ^,,,,„,,ieotide cpVlB 

„U,onucleotide dpVi. and a capt^^^^_^^ 

an -ri:entined by - --^^^„,^^„,,„, « 

, ^ ^s. capture °^^^;";;^;.\"„tsted.. 

D of the products of MSW 1 „ 

primers '^--"f ^ ."cation with the primers 
optionally „d XO »0=1. on samples 

identified by SEQ ID . , be 

in from patients. system enabled it o 

«r«le* '»»^ °' !,;„s prtient. contained a se^e^e 

0»ase-treated "^^.^ were positive with the 

„t the subfamily A. 7" ^ „4 o. 

as laensus of the -«a"^^^^,,=„ technic 

.or „m infected cell cultures. 

eted on ^-^'"^"^^'"t^Uf icatlon primers, 
using the following "^."^ ^y SEQ ID HO=34 

= 1 orimer, identitieo .„„„dlng to a 



r rp.i;er, "-'"^^^^^=:^:J:sponding to 

<=nr PGR on samples J-i- 
MSRy-2 PCR prx«.er, fo ^,^.35 

. 3« prixaer, J 3 . corresponding to 

GMGGCCAGCAGSAKGTCATCCA 3 ^ ^^^^^^^^^ 
for PCR on samples it 
3. MSRV-2 PCR Prit«er, for 
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,«ture oligonucleotides with an amxne 
""^^ Tcpv" and the .iotinylated detection 
bond at the 5- end cpV2 and ^^^^.^tive sequences: 

Sde dpV2 havxng as tneir t h' 
oligonucleotide <*P^ g^Q ID NO: 36 

- cpV2 Identified oy corresponding to an 

3 oaATacccccT.T.cc-^^^^^^^ 3^^,^ corr^ J^^ 

ELOSA capture g^Q ID NO: 34 and SEQ ID 

::3r::c:-r:- ...e. .... 

by Shih (12) • JO N0:37 

. apV2 Identities by SEC I eorrespondln? to 

„ detection -^J_7;^^;;Y:e:e SB. x» »0=» and SZO 

„SRV-a PCR performed with the P ^,,,„er.te primers 

ID NO:35, or optionally 
5 defined by Shih (12) ^^.^ with a p»ir of 

This PCR .^„^„ described previ- 

,,,„ers different fro. ^^IJ^^l^^Z petients »de 

Lsly for ».P^i"-"- nf elion with «SBV-2 of in 

,„ ttjrt^r; rr o. nodeic ^id, ..d for 

the molecular biology ^^^^^ results of PGR 

^^'"'%nl^' oT plZenic and/or infective 
detection of the geno- of^^P^^^ ^^^^ ^^^^ 

agents show that it i P ^ients in an acute, 

,5 circulate in the ^1-;^ J^^J ^..^.^ system. This is 
virulent phase, ^ -^"^^^^^,7„,,,,.ble presence of "gaps" 
compatible with ^^^^ j;2r ;,%,,ients in an active phase 
in the blood-bram barrier of P 

of MS. 

O, TH. HSKV-l X-™""*" ,,,<,,,hed in Example 5, a 

« has alreadyj»aen 
PCR technique derived fro» th ^^^^^^^ ^^^^ 

"::r:ie;^irarc":cpri.eratthe3.endofthe 



wo 98/23755 



53 



... .I elongate the sequence towards 

,e„o»e to .e -P^^^^^' an^lysea. This technical 

tfc. region o£ ^'"^ aoou„.„t.tlon of -Clontech 
variant is ^'^^^'^'^^^l^lo California, OSM supplied 
Laboratories ..miriHDER™ RACE Kit", which was 

with its product described above, 

ueed on a fraction of ^ '//"^^-^^^^.........on of the 3' 

/irHsJ-" retroviral genome encoBpassin, the 
region of the HSRV re carried out to 

region of the •■env" gene, a stu,^ ^ 
„ determine a consensus ^^--^J^^^Z^Js retrovirus 
:Z.TZ - «SRV-1 retrovirus 

displays partial homologies ^^^^ 

The same specific 3 pri» ^ 

-"r\rn.rssrreisrsriowr 

synthesis Of -^^-^--r^the abL pri»er 

bidirectional . -^Jf^^^tr-ordin, to the method 

^ar-T-ied out m one si^ep 
20 were carriea 

described in ^""^ ^;*-:;r;,;:L„g from the PCR were 
The products oriq i according to 

extracted after P"f 'f^^^ „ %r re Jspended in 10 ml 
conventional methods (17), properties of Tag 

„ of distilled water. ;ldenine at the 3- end of 

polymerase consists in addl g ^^^^^^ ^^^^^^ 

each of the two DNA ""'"^^^ '/^^ °„ cloning™ .it (British 

directly into a P""/^ "^^rsolution were mixed with 5 
Biotechnology) . The 2 ml of DHA^^^ ^ ^ ^^.^^.^ 

30 ml of sterile „x„x LIGATIOH BUFFER-, 2 ml 

concentrated ligation buffer ughSE". 

c, ..pch™ vector- <"^- : J"Ver„ight at la-C. The 
This mixture was ^"""^^""^ .ocording to the 

following steps were ^.^ ^^^^^^^^ siotechno- 

instructions of the ta ^.^ colonies of 

logy) . At the end of the procedure, 



35 

logy) 
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/«hite^ were picked out in order to 
reco»a,inant bacterxa (wh.te) J p.^^^ias 

,e cultured and to per.xt -^"^^^ ^^^^^^ ..„,„ip,ep" 

Pxas»id preparation fro» eacn 
procedure (17). T ^ ^ suitable restriction 

, recombinant colony was cut wi possessing an 

enzyme and analysed on agaro- ^ - ^ J^,^ ,,,, 
insert detected under UV ^^^^^J^^^^^ sequencing of the 
ethidiutn bromide were -^^'^^'^^ ^^^\,,p,ementary to 
insert, after ^V-i^i-t-n w.t. a prx- ^^^^^^^ 

° .ne S.e j^^-^-::^Z pAor to sequencing was tben 
Cloning^" Kit. The rea ^ecoxamended for the use 

performed according ...^tion Kit dye 

of the sequencing Xit (Applied Biosystems, 

deoxy termina uw-- 

cycle sequence. ^.^ . ^^ carried out 

with an Applied Biosystems „.anuf acturer • s 

373 A" apparatus according to 

instructions. approach was applied to a sample 

,nis below from a mixture 

,0 of virion concentrated as des ^ ^^^^.^.....oid lines 

of culture supernatants P^"'*" established from 

such as are described ^^ J^^^^J; possessing 
lymphocytes of patients suffering ^^^^^ ^^^^^^^^^^ 

reverse transcriptase ""^^"'J^^^ perron et al. (3) : 
according to the technique weekly, 
the culture supernatants ^i^^tes to remove cell 

precentrifuged at ^0'0°°;^-™_;;V:/:3e. as they are for 
debris and then frozen a ^^^^^^ supernatants are 

the following ^^^P^- J^^ giycerol-PBS at 100,000 g 

30 centrifuged on a cushion of 30 9 3^pernatant, the 

.or 2 h at .«C. sample of concentrated 

sedimented pellet .hereby obtained is 

but unpurified ^n appropriate buffer 

then taKen up in a -^^^^^^ synthesis reaction 

35 for the extraction of RNA. 
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. „.„led out on this 
„e„tloned above is carrxed 

concentrated extr.=ellul« - ^^^^^.^^ ^^^,,ue 
RT-PCR •'»Pl^"°'" ^i<,„^ m3 to be obtained, 

^- above enabled tne ^-..-^ is presented 

Toentioned above ^, gEQ ID NO: 46, xs pr 

Whose sequence, xdentxfxed 

in Figure 13. sequence homology between the 

in Figure 14, the ^^"^^^^^.^^^ ghown on the 

clone FBd3 and the HSERV-9 re ^^^^^^^^ 
matrix Chart by a -"^-7;^^^",, .e seen that there 

, greater than or equal to 65%^ ,,,„e (with 

Le homologies in the f lanKing r g ^^^^ 
the pol gene at the - -^J-^J^ ...ernal region is 

the LTR at the 3' ^^^^^ any homology, even 

totally divergent and does no ^.^^^ermore, it is 

.3 weaK, With the "env" gene - "^^^^^^^^ , ,.„,er "env" 
apparent that the ^^^^^'^ for the defective 

region than the one whxch x ^^^^ internal 

endogenous HSKRV-9; ^^^^^^f .^.^ert" between the 
divergent region the HSERV-9 defective 

20 regions of partial homology 
genes . 

rr"; »T"s;r ^bUovx^. .o«.» 

OF THE REGION OF i""^ 
25 BETWBEK THE CLONES MCri7 AMD F ^^^e 
rour oligonucleotides , Fl , ,,„,entrated 

aefined for amplifying ^^ ^^^"^^^^^^^^ control reactions 

virions of the straxns P0L2 an^^^ ^^^^^^^^ ,f 

were performed so as amplification 

30 contaminants (reaction wxth ...^rding to the 
consists of a first ^^^f J,,,,,ion EP-A-0 , 569 , 272 , 

protocol described xn Patent App ,0 ml of 

followed by a second step to the 

product Of the first steP w^t^^ P ,,.pc« 

35 amplified first regxon ^ nes ed ^^^^ PCR 
cycle, the primers Fl and 



PCT/IB97/01482 

WO 98/23755 



56 



ni are used . The 
^ele. t.e primers .e a„a the pr.«r B. 
:^t;= ar. positionea as follows. 



Fl F6 



Their ticGGCATACTCACTG (SEQ ID NO:47) 

The product of "^^^^^ pigure 15, and 

■ "t pel" is presented 

,0 corresponds to the seque 

.»,T«r NEW SEQUENCES, EXPRESSED AS 

- TM CULTURE PRODUCING MSRV 1/ 
IN CELLS IN C"^^"'' RETROVIRAL GENOME 

^ nenv" REGION OP THE ^^.^ed according to the 

A library of cDNA was P'^^ ^he "cDNA 

" procedure described .V - X^or U.-^n module, 
_»,4iilB cOB* rapid aaapi- vitro 
synthesis module, >- lambda gtio in 

I»X rapid olonln, .odul. ani ^V/Z, l<roi712, 

TacKa^m, module" '^^^f^^^^^^Jl^^^^.r ™. extrected 
30 RPH1713, RPH1717, ^"^.lasroid line such as Is described 
„o„ cells of a B ^^P''""';" „„ a,e lymphocytes of a 
l„ Example 2. established fr ^^^g reverse 

suffering J ..cordln, to 

,3 rrciZ: -rlJd by perron at al. .> . 
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clcnea i„« ;,7i;r^. <x.™. region o. 

Cloned into ^^^^^^'//J, pol ee^ences to the 

„3HV-1 '=>nP-nv:;J\:e%..„eent.tiv, of tne ee^ences 

" o.tainea in the ceUular ^'^-^J^J'-^ „ „o=S. Is P«- 

TT ni-n whose sequence 

- the clone JLBcl, wnuc. 

sented in Figure 16; 33 pre- 

TTno9 Whose sequence Sii>y 

- the clone JLBc2, wnut. 

.„„oxo,ous to tnat Of X'ee: t.. clone ^oX 

Fiaures 18 end 19. The hoBOlo9y ■> 

i rthe Clone O^BC. le .hov™ ™- -^ _ ^^^^ 
.ne ho.olo,ies he^een^^t ^^^^^^^ 

JLB02 on the one hand ana t figures 21 and 22. 

hana are presentea, -'-"^^J'^^^ \^ ,„icn of homology 
It »ill he noted that tn sequence 

between ^Bl, and ™« -"'f; J^^; , aaaitlonal 

" h^e^t' rTn-erted", In the HSE«V-, env sequence, 

sequence absent ( msei. 

,s described m ^aaple '^ „cted that the cloned •■pol" 
It Will also o a<,^e not posses, a 

„,ion is very ^'^''^"^ ^^"Z sequence errors induced 
,„ readin, fra«. (bearing in «na ^^^^ 

by the technKIues used, ^quenoes obtained 

7e^encer, and diverges '""J-^^^^f,,,, these sequences 
„o. virions. In view ofJ:h ^^^^^ .^..ssing MSRV-1 
„ere cloned from the 
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or-« likely for the fact that the pol and 
this is all the — ^^^^ is Clearly not 

TMllZ SO- Of these ERVS elements 

roIseT functional LTKs 1; J^^ J/ ./tetT^^^^^^^^ 
.eplicative viruses co-^^^^^^^ the relationship 

transactivators. Under ^^-^ ^^^^^ p.^^able the 

between MSRV-1 and HSK ^^^^^^^^^^ endogenous 

transactivation of the defect j identical, MSRV-1 

ERV9 elements by homologous, or even 
0 transactivating proteins. interfer- 

Such a P--™-°';;\^/::r-l and the related 
ence between ^ ^pres^on ^^^^^^ .^ads 
endogenous elements. Such ^\ expression, some 

to a so-called the MSKV-1- infected 

. features of which ^ - ^ ,oes not 

cultures --;^^-^;/:^^nxpres ion of polypeptides, or even 
lack generation of the exp ^^^^^ 

of endogenous retrovir system, such a scheme 

necessarily tolerated -^^^l^ZJl^^r^.n.. related to 

20 of aberrant °/ ""^e! is liable tc multiply the 

MSRV-1 and induced by the latter the 

aberrant antigens, an observed in 

induction of autoimmune processes such 

MS. essential to note that the 

It is, however, essentxax ^^^^ 
25 ^ TT.BC2 differ from the ERVS 

clones JLBcl and JLBc2 a possess a longer 

seguence already ---^-^;/";;di,ionaI region totally 
env region ^^"'P"^""!,^^^^ ^^^s^ip with the endogenous 
divergent from ERV9. '^^^^^^^f^^^ ^ ^,t they clearly 
ERV9 family may hence ^^^^^^ described. In 

constitute novel ,,n,s of nucleic acid 

effect, interrogation of the da 
seguences mailable in vers n ^Ho^^^^^^ ^^^^ 

"nir: :::rhloCus seg^ence in the env region of 
" Ts c loneTto be identified. 
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EXAMPLE lO: OBTAINIKO SEQnEKCBS LOCATED IN THE 

ool AND 3. gag region of the msrv-i retroviral genome 
^ a! has already been described in Example 5, a 

3 PCK technC derived fro. the technic^e published by 
5 PCR technxq technique derived makes it 

Frohman (19) was usea. 

usinq a specific primer at the 3 ena 
possible, using a p ^e the sequence towards 

:;ore i j ... .^n.. 
"°r: rrtr^o ret Ki.., 

Zrls ..action o. vi.ion puri.ie. as 

described above. i f ication of the 5' 

Tn order to carry out an amplification o 
. . MSRV-1 retroviral genome starting from the 

region of the MSRV 1 re extending 

pox sequence already ^^^^^^^^^^^^^^^^^^^ ,.,.ers were 

towards the gag gene, MSRV 1 sp 

defined. ^^^^ ^1,^ „it protocol 

The the PCR amplification 

for the synthesis of the cDNA following MSRV-l 

are, respectively, complementary to the 

CCTGAGTTCTTGCACTAACCC 
amplification: (SEQ ID NO: 55) 
GTCCGTTGGGTTTCCTTACTCCT 

..e products o^^^---lror;er c^^^^^^^^^ - 
extracted J^f ;:;;-::\r resulpen^ in 10 ml 

30 conventional methods (i ) . properties of Tag 

of distilled water. Since o ^^^^^^^ 

polymerase consists in obtained was inserted 

each of the two ^^J^^^^^^^^^ Kit (British 

directly into a P^-^^/^* ^^^^ ^^.^^ion were mixed with 5 
35 Biotechnology) • The 2 ml of DNA sol 
ral of sterile distilled water, l 
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. ,f.r "lOX LIGATION BOFFER", 2 -l 
concentrated U,.tion buffer J 

„p,«TK VECTOR.. "^^^J" „,,„l,nt at = . 

T^is mixture was '^^^ according to the 

.ollo-in. steps '""/",;"%ic„in,. <»""^ 
5 instructions of the TA ^a^^, the »hrte 

Biotechnolow) . ^^'-^'^^J,,, ,„hite, were plcXed out 
colonies of ""' to per»lt extraction of the 

in order to he cultured '° J ^„ ^he so-called 
Tlasnlds incorporated acc^din,^^^^^ preparation fro„ 
10 .„iniprep.. procedure (17). Th P^^^ ^.^^ ^ 

.ach reco.hinant -^^^ "^"^ „,.„ee .el. Plaemids 
restriction enzyme and analys ^ ^^^^^ 

po.ses.in, an l-er' det-te ^^^^ ..ieoted for 

'stainln, the ,el ^^^^ ^^^'^..r hyhridi.ation with a 
ee-iuencin, ct the rnsert the 
p^„er ccplementary "^^.^J Kit. The reaction prror 
clonln, Plas^id of the ™ .„„„,aln, to the method 

^4«rr was then performed ..prism ready 

to sequencxng was ^equencxng Kit Prx 

recommended for the use ° ^ sequencing kit 

v4t dye deoxyterminator j automatic 
20 reaction kit aye 401384) , and a 

(Applied Biosystems, ref. ^ ^^^^ siosystems 

i:::encing was -riea jut with^ ^^^^^^^^^ 
..automatic sequencer 

the manufacturer's ^^^^^^^^^^^^^^^ „as applied to a sample 
This technical -PP//;^ ^....^ from a mixture 
of virion concentrated -J-^f 3 ly^pho^lastoid lines 
of culture supernatants P-<^-^^ established from 

as are described m ^^^"^V and possessing 

of patients suffering from M ^^^ectable 
lymphocytes of P ^^tivity which is a 

30 reverse transcriptase .j,^^ by Perron et al. (3)- 

" according to the technique twice weekly, 

tne culture -P^^^^f ^ "^or 30 minutes to remove cell 
^recentrifuged at 10,000 rpm for ^^^^ ^^^^ 

Tebris and then ^^^^-^j\;Z ^r thawed -P— "^^.^q 
33 the -Uowing steps. - -s^^ ^^^^^^^...^S at 100,000 g 

centrifuged on a cu 
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f the supernatant, the 
, h at 4»C. After removal of concentrated 

unpurified virions. The P appropriate buffer 

then taxen up in a ,,,, synthesis react- 

3 for the -.ractxon^of^--^^^ ^^^^ ....acted 

,0 Whose sequence, identx 

in Figure 23. hoiaology between 

in Figure 24, the ^ shown on the 

^ *-he HSERV-9 retrovirus partial 
GMP3 and the noi^ une. for any 

clone enr continuous line, 



15 



20 



chart y ^„ 55,. ^,^tion of 

a^ve t.e point - ^^^.^ material 

on the unaerstanaxn, that t^^^^^^^^ „ 

« seauence and size va 26 shows the 

position of dxf ^ 
S region. . ^^^^ ^lone GM3 described abov . 

By means of the ^ defined, covering the 

possible reading fra.e cou d - ^ S.O XO 

Lie of - J-eCcessive Figures 2.a to 27c. 
„0:57, shown in the ^..^^.V-i BPBCXFXC 

^XBODXES IK HUMAN 8BRUH pol gene 

^ Identification of the J ^^^^.^^ of 

1 retrovirus and of an v ^ 
of the amino acid sequence SEQ 

35 this gene enabled the 
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. referenced SEQ ID N0:40. to be 
a region of tne said .ene refer 

determined (see F.gure 28) . eorrespondxng to 

Different synthetic P P ^^^^^se 

.rag^ents of the P--^\J^^;r;ene were tested for 
3 trarscriptase encoded by t,e pc, ^ .^^^^^^^ 

their antigenic ^P^^"^ Wealthy controls, 

patients suffering fro. MS a-i ^^^^^^^ chemically by 
^ The peptides were sy ^errifield tech- 

3olid.p.ase synthesis —^^^^ ,,30, Xn the 

- rr l!;s.:" CroT K and Heienhofer . 

peptides, 2 1 practical details 

Academic Press, New 

described below. 

a) peptide synthesis. ^ phenylacet- 

The peptides were ^^nthe-^^^^^^ 
^Hvl (PAM) /polystyrene/diviny 

amidomethyl ^ City. CA) , " y 

,<-4-£>Tne:. inc* . fpKo amine 



amiaofti"*-"^- poster ^x^^' . 

(applied Bi-V=*-=';;r;«to..tic synthesizer. The a»xno 
-Applied Blosysteme " „f hydroxyUeniotti^ole 

":.r"t -rJri: .ed 



25 



(BuMndorf, 5"^'^"'^'"'^ ' ,„„thesU was performed using a 
The chemical ^''""^'"^^thyipyrroUdone (»MP) as 

,ou.le coupling P"-;y;':.: :;rjr resm. as well as 
eolvent. The P^P'"- "f;^ ^^^.p,. simultaneously, us.ng 
slde-chaln P"'"";; ^ ,„\,,.ie apparatus (type X 
hydrofluoric acid (BF) l ,„,,,t„te, Osaka, Japan). 
Cleavage apparatus. "P^^^ \"t:,\„, ',0 ml of 1 -1 »^ 

j,„ 1 g of peptldyl "^^ ■ „e used. The 

anlsole and 1 ml =f ^^-^'.^^^f.r-.-c. The H. Is then 
Tlxt^e IS stirred for 4= --";„'\„,,^ive washes with 
'evaporated off under with 10* 

. the peptide is ©J-^^ 

tril add and ^P^t'^u^^ed hy preparative hl^ 

^\C^:ZrZir.^^ on a vvo.c CIS type 
performance liqu^" 
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litored by an elutio- under i ^ „^ 

CIc. analytiol Jt/eL "Mention ti.e «e 

TrUpon-rant fraction is «.en 
^led and W'P"'^",'";. ™° J,h performance 1^^* 
.n.W.ed M 'I'^'^^e sy=«B described 
chro»a«grapnv J^^^,/ „ be of — P""' H^t, 

pep^ae ».1=H^^U represenUn, not 

TT^^^-^^^ are »en analyad wUb the 

purified P^^"^;;. ^f.,,, coBposition, us.n, 
Object of »onitorin, tb- ,..n^^^^ ^^^^^ ^^^^ 

Riosvstems 420H au^ molecular mass ot 

' ;:.r :- ortbe -vera,. cb«..cai .o.e^^^^^^_^^^^^ „ 

XTeptidee is obtained """^ aouble focusxn, 

Te POsi"ve ion on ajo. J ^^^^ ^^^^^^^,„„ 

instrument connects _ . 

,^ (VC analytical Ltd, "» different peptia 

" reactivity of ^^^^ 

- :c:r; ^:.e w o. .... 

• b) Antigenic P-P^^^^^^^ the P0L2B peptide 

The antigenic propertxes ^^^^^ p^^^^,,, 

o^-rated according ^° 
«pre demonstratea 
30 were « evolved in 

— r".Pbiii.ed -::rarn Ti ..Mi; 

r":r:rontra::.:tea and .apt 

Thxs stocK 8 ^^^^^ .20 C r buffered 

35 over a ^^^^-^^^^^ ° ^ ,uuted in PBS (P^o^P^^^^ 
xaonths. An aliquot 
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obtain a final peptide 

«.line) solution so as t: ^icrolitres of this 

concenUion of X --^^^^^^^i^ ' .icrotitration plates 

! on are placed in each well of ^^^^^ 

dilution are F roSTAR ref: 359U) • 

„^ir>a" plastxc, co&ift ,^v,osive and KepT^ 

^,L.s o£ a solution A (" PBS. drained 

r/tea over an a^— rpe^^-" ' 
I filled with ^O" J, then covered »lth an 

' . + 10% of 9oat seru»), th^ 

^•^^--Ther;t-eer...a.lee 

in the solution B and 100 - ^^^^ 
.'est seru» are P^'"-* "^i^ced in one well of each 

rr^^r:rr .iLoiitres o.^^^^^^^^ 

" ;ire; -veted with an ^^^-^"Jl^ lL. three ti.es 
t , hours at 37.C. The P""= 'J^^^ xn parallel, a 

1th the solution A '^^"'''"'^ directed against human 
«oxltrse-labelled goat antihody 
r. ?sig.a I»unoche»icals """^dilution 1/5000 for 

" sf» rts diluted in the "'^Z^lU ■ --""'"^ 
Te an*i-MS and 1/1000 for the ant. J ^^^^^^ 

Tthe appropriate ^i^"""" ^'^otitration Plates, an^ 
^en placed in each well "'^^ i„cubated for 1 to 

, ».s covered with an adhesiv ^^^„ 
30 P"*^, A further washing of the P ^^.^^^ 

^ ""''"d as d scribed above, m P""^^'^^:,:::,! of tne 
performed as » according to the dire 

""""VasroP K^. (Bigma X-no-mic.ls^ - 

,3 ;r:icroUtres o* substrate solution 
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are pla«^ protected 
well, ana '^"^/'^ „o™ temperature. ^^^^uued, «>e 

,„ to .0 """^^J^ ;„,our reaction has at ^^^^ 

Tp"«^ optlcaraeneity (00) <>' 

plates are Pl „a the optioa 

,pe=trcphot«.etr- ^ ,„ele„,th o* * ^^^^ 

natxvely, tion. and the P 

to stop the 2^ ^^^^rs. ^a^ced in d^plx- 

> ' ttipXlcate, and the ^ed by "km, 

" ;l to tne Beru» e,Be sa.ple at 

correepondlng to^_^ ^^^^^ „^„l,,d tor th 

the nean ol „eai> 

- TrZ'o. oe eaoh seru. — C-e control 

" n;: ^^^^^^^^^^ - 

— .tectlon o. ant.-»s. ^^^^ ^^^^ 

,he technique described .bov»_^^^ -"^^-^r^t 
" Tho^Usttrel^o. .. pa--- 

--"^"XrrP^obable and o. » 

whom a 'i'"-" „ the criteria ot P 
llshed according 

„ health control, (b "'""J J° vertical bar 

tested wi'^" tlcal density (00 ^t the 

..presents the a.U gives t^^"^^ 

serut. tested. Th , . represent the sera 

30 °' ^'-t/trre vertical bro.en ^-^^ai ^ars lying 

to the left of th 32 ver ^^^^^^ 

„ cases of MS ^^^^^^^ ^^^.n Una 

°' »ean of the jet 00 v^^^ ^ to be 

tested is O.e.. 
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*-he qrouped values of 

^ -FoT- the controls, 
The mean ot th« -et ° ^^^^^^^ o.36. 

incluam, the con«clB "^^J^';^; .,,ues are greater 
Without the 5 controls whose net „ controls 

Tn or eoual to » ^^r^^^fthe negative controXe 

- r.o Tt;rroarrre.ho. ^ 

t:.:;.t.a accorain, to the fo^^^ the 
threshold value ° , , standard deviation of 

15 seronegative «'>«°^=' ,„on.gative control.) ■ 

the net OD values of the .« g „„sidered to be 

in the first case, ^^reshold value is 

sy»pto-.X fo.S3. The negative results 

e^al to o.» * '^J,;,-",L.Kground.. of the Presence 

peptide. ^ if the set of controls 

P I, the second case, ^^^^^ ^^^^ ,s 

eonsistin. -j-teTsTs;:^ ^r^nr^ 

25 taken as a referenc seropositive, the bx. 

Which are, on the ° J^;,,.. 0.1x6. The threshold 

deviation of the "non-MS con ^ ^^^^^ 

tllue then .eco.es 0 • - ^J^^^ J, ,,3 test is specif.c 
According to this ^^^^ ^^^^ ^^^^ ^est xs 

* ^ -MS in this respect, it control 
30 for MS. -L" shown m Tacie 

MS since, as »" -,»^4. this result 

rrn^t^ »"ahove this t.es.1^^^^^ ^„ 
reflects the fact that ^ than 
,„ffering - -JV-U in contact with MB.V-1. 
35 healthy controls wno 
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TABLE 



NO. 1 



10 



15 



20 



25 



30 



MS 

0.681 

1.0425 

0.5675 

0.63 

0.588 

0.645 

0.6635 

0.576 

0.7765 

0.5745 

0.513 

0.4325 

0.7255 

0.859 

0.6435 

0.5795 

0.8655 

0.671 

0.596 

0.662 

0.602 

0.525 

0.53 

0.565 

0.517 

0.607 

0.3705 

0.397 

0.4395 



CXDNTROLS 

0.3515 

0 .56 

0.3565 

0 .449 

0.2825 

0.55 

0.52 

0.2535 

0.55 

0.51 

0.426 

0.451 

0.227 

0.3905 

0.265 

0.4295 

0.291 

0.347 

0.4495 

0.3725 

0.181 

0.2725 

0.426 

0.1915 

0.222 

0.395 

0.34 

0.307 

0.219 

0.491 

0.2265 

0.2605 



35 



MEAN 0.62 
STD DEV 0.14 
THRESHOLD VALUE 



0.33 
0.10 
0.53 
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•^h the first method of calcula- 
in accordance with th corresponding 

.ion, and as shown ^ ^^^'^j:^ positive result (net 
arable 1, 26 of the 29 MS sera g .^^.^^^.^^ presence 
OD greater than or equal poL2B peptxde, 

Of xgos specifically directed aj-"^^^^^^^ ,«„.criptase 

hence against a ^-^^^^^olns encoded by its pol gene, 
enzyme of the HSKV-l -^^^^f ^3,,., retrovirus. Thus- 
consequently J\,,,ents tested have reacted 

approximately 90% of the MS P ^^^^^ ^^^^^^^ 

against an epxtope ^^^t the latter. 

pLsess circulating ^^^^ ^^^^ -nors in apparent good 
Five out of 32 13 apparent that 

approximately 15« «oitope carried by the 

. „«4-act with an epn^oF , ^ 4.« an active 

...cine seru. X,0S- -""r.^jriva- 



20 



speciiJ-^ MSRV-i i.^--- - ^.„,_ 

an immunization agaxnst th ^^^^^^^ ^eactiva 

transcriptase during an ^j,3ence of apparent 

Tion Of ) the HS.V-1 retrovirus^ T.e^^ ^^^^^ 3eropositive 
neurological pathology recalling M ^^^^.^^^ 

\o «av indicate that they are immunizing 
controls may m . .^^^^ous virus after 



25 

of 



30 



of chronic carrier present in the envi 

retrovirus constitutes aU or P ,^ 

a,ent» at the source of «S. „ ' i 

i« taken from a healtny ^ ^ ^^e MSRV-1 

retrovirus. Thus, the di 
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. e is extremely significant: 
,ne MS an. control POP^^^^^^^ ,,,3, results hence point 
"chi-squared" test, P msrV-1 in MS. 

„ .etiop..hc,e„l= role » "^^^^^.^ ,„Uboales W 

d) Detection ox 

ELISA: ^ „ith the P0L2B peptide was 

The ELISA technxque wx specific 

used to test for the P--^ for who. a definite 

antibodies in the — f J,^ established according to 
or probable ^ia^"-^^ / of 42 healthy controls 

w ^r-iteria of Poser (23), 
) the crix-"-*-** 

(blood donors) . each serum tssts 

< ngure « shows the res ^^^^^^ represents 

,,,, „ antl-I9M antibody^ Bach ^ , 

«,e net optical oD at the top «»e 

ordinate axis ,xves t^e ^^^^^^ ^^^^ ^ 

vertical bars. '^"^ ,„«ln, the 

,.ft of the vertical Una tested, and the 

trrasent the sera o. 3. cases o. 

vertical bars Ivlh^ " '^"^ "'^^ ^,,itny controls (hlood 
,0 line represent ^-^ l^'^^^^ in the middle of t^e 

donors) . TOe ''leal threshold defining the 

diagra. represents a '^"eoreti the 
t^darv Of the PO^-- .Uults cm wmch the top 

ber lies above) and the neg 
„ the bar lies bel».) - ^ ^^^^^^ the HS cases 

The mean of tne 
" r »e,n Of the net 00 values for the controls 

" ^= - - t"ro:::rrre.rop 

35 the formula: 
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of the net OD values of 
threshold value ' (mean ^ ^^^^^^^^ ^^^.^^.^^ 

nTorrs :Lone,aUve con.rols>. 

,3 X 0.05) . 0.2e; or, in PJ--^^^^^^^^ a non-specific 

^ The negative results rep ^^^^.^^ directed 

4= *-he presence or 
«background" of ^ ^^.tope of the peptide, 
specif ically against an ep^^tope 

According to this an y ^^^^ 
.0 and in the corresponding Tabi . ^^^^ 
Figure 30 ana j.» f^ontrol has a ne-c 

L ,pecl£i= tor MS, «n=e no cont ^ ^^^^^^^^ 

.hreshoia. 7 Of tne 36 HS sera P ^^^^^^^ 
.esul., now, a s.uay °^,„, , „„t a«.cK of KS 

a>e.e po.i-:ive sera -re "l^^" „ i. Known that 

„ an acute a«a=K untreate^ P 

alrectea -^^^jtlrin, reacUva.ione foXXow- 
durlnq primary "*"="-° pathogenic agent. 

a latency phaae of J^'^^^^^.^aience between the MS 
The difference in serop algnificant: 
and control populations le 

„„,.l.svared" test, P < aetiopathcenic role 

These results point 

- •^''^-^ ;rdet^- Of x,M and x.<= ;---/r„r- 

" the PC. ::^\Te.=or If MS.V.1 to . 

tion and/ or of the 
evaluated. 
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and 
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TABLE NO. 2 



CONTROLS 
0.064 0.243 
0-^98 

5 0 115 0.028 

0.025 

0-097 0.176 
0 108 0.146 

10 

5.314 0.093 
15 0.522 0-"' 

0-306 S-n?,. 

'All o-.lll 
o-.iu 

0.033 O.llB 
°0-187 



0 



35 0.0-7 



0 .04 



973 O.u/J 
0.008 
0.074 

40 0-^^^ 

0,219 

0 .047 

0.017 



0.09 
0.05 
0.26 
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sea.c. .o. .«»uno— epitopes in tne 

P0L2B peptide: non-specific ^-^J^;^^ 

in order to ^f"" responses of the 

ana to optimize tne --^^^"^^^...^is of oc.apeptides 
5 anti-MSRV-1 antibodxes, the J ^^^^^^ the 

r olna in successive one ^as carried out 

advancing in determined by PO^^B, wa 

whole of the sequence 5^^^^ below, 

according to the ^^^-^^^''^^Z of overlapping octapep- 
The chemical synthesis ^^^^ 

,0 Uaes covering «.e ..mo .=1^ -u- e . 

fetlulcse accord-, " -arKeted by 

!l (X"9- J- under the trade neme 

tbiia^e .esearcn ^^^^'^^J^ «,e e..u.tanecas 

Spctecan. ™is peptides and thexr 

synthesis ol a lar9 ^^^^ 

«.e synthesis Is ^"^^'tC ^^^^^ro^^ 
adds m Which the a-a-n° ;-^°J ^,,,.,,.,„ ^cups 
r woe ^oup (Kova Broche») and^ ^^^^^^ 
"th protective groups such as tr y ,,,„biuzed xn 

"^tyl ether. The -""^f^ a concentration o. .00 -. 
„.„ethylpyrroUdone (»«« • of hroBophencl 

and 0.9 Bl are applied to spots ^^^^^^^ ^ 

hlue. ^«.r carried out according to 

application of a,lno acids iS 

t'other "^"'^Z.o" correctly, a color.txon 

»lno acids has -.enj^a^ ,eUow-green, is 

^Ulcatlon (Change fro. ^ ,eetyl.tion step 

„ ohserved. After three -she' ^ , ^^^^^ ,,^.„al 
is performed with acetic anhy ^^^^^ synthesis 

::ino groups o. m ^he spot. Of 

deprotected with 20% pyr «:c«ophenol 
depo.lt are restained -th • 1* ,,.ed. 

• n»i7 washed three times wi addition 
Ts t^tTf' o;eratlons constitutes on. cycle 
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35 
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is repeated until the 
an a.ino acid, .his cycle ^^^^^ ^^^^^ ^^^^ 

synthesis is complete. When all ^^^^ ^^^^^ 

added, the im2-tert«.nal group .eetylated wxth 

aetroUcted with 20% P^^-^^^^^^^Ji,, the side chain are 
:::Lc anhydride. The ---^^f^^^lLthane/^ 

.e.oved with a ^ .1) 
acid/triisobutylsilane (5 ml/ ^^^^^^ eLISA. 

oi-ivitv of the peptides is ^^i-aoeptides in 

inoftunoreactivity o different octapepi^i 

After synthesis of the ^^^^^^ ,,,3 

duplicate on two <^^"-^"; "bs 0 • ^« " ' 

tfnsed with methanol and wash m (^^^^^^^^^^^ 

then incubated j',,,! washes in TBS-T (O.lM 
. . i..,*'fer. After sevBi. incubated 



15 



then in^—— several wasnes — 

saturation buffer. membrane is incubated 

.ris PH 7.2 - 0.05% - 3eru. originating from 

with a 1/50 <ii^-^-" °'^^\3' the other .e^brane wxth a 

a patient suffering from MS an ^^^^^^^ ^^^^^^^^^ 

,/SO dilution of a pool °^ -'^^^^^^ ..o. temperature. 
membranes are incubated for ^ ^^^^,ia,3e-labelled antx- 
^,ter washes with ^^-'^ ' jj^^ ^.arKeted by Cambridge 
i.u»an immunoglobulin conjugate ( ^^^^^^^^ 



;^arch Bicche^icals, U added « • - 

.^e .ixture U i--*^^ J:^„„,, „.o« ™s- 

te:.perature. &«er -""es =£ th ^,j£^„ent spots is 

Tana PBS. the l»-"--=";;'i;.:.e.loro-3-lnd=lyl f-"" 

galactopyranoside in P« ^alitatlvely «ith a 

coloration of the spots is estl 
„l.tive value t.on 0 

Figures 31 to 33. possible to determine two 

in this way, it is P ^^j^^^ peptide, 

" immunodominant '=^''"^^.^''^1^. amino .old se^ences 

correspond!.,, "^^^"^^'1;.,,, (SEQ ID NO=42, , acoordin, 
,5-75 (SK ID H0:41) and 92 10 I the 

" ngure 34, and V-^^„;:^^„,.p„-«P 
.nd »g-Pro-Mp ser 
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Tnr-Val-Leu Pr (^^GQALAQ) , and ^^^^ 

'^^^-^'"'^"'ve apparently ^^^^^ control seru., 
XesB reactive ^^^g^ound vxth ^^,,.pne-Glu-Asp- 

^^ta%V octapeptides ^;Z-^,^.,..-0..^^^^- 

5 represented By ^(,,43) and P">e 

rtw^rh are more sp 
peptides «hl ^^^^^ techniques. the 

" ,e thus possihie -^^^^^ ^he inven- 

• . Mde and »^ »ethods i„£eotion and/or 

discoveries »»d . „„sis ol «5W ^he Msis 

tors, to carry °"^/^^,,„,te a therapy ^ ^hese 

15 of its -«^;;°' tients. hiolo,.cai „i„, „euro- 

agents in the P ^^^^le not yet dl P , 

early ^-"""""/^s couid it P°--"%7,ective with 

1 vould ^ " „:se " r the fact 

treatment «h;^ .^sequent ciin^-^ JfJ^.^, .^responds 
JO respect to the lesion st.,e « 

that i'^ """^^ neurological disorders. ^^ed 

to the onset °' " , of MS "-°\::,„„e 

a'C;^"-"-" °' "^"'"t : tuted hefore the 

before a SY^^^P treatment is msT. lesions of 

- cUnicaX picture ^ :::,;^,,nificant. 

exaergence of ^^^^^^ ^^^^h are a ^^^^^^^ 

tne central nerv ^^^^^^ „sRV 2 ^nd 

^ne di-^'^°^^^°^^n is hence of ^--^^'-"^Zin^ this. 
--^""rinUTon provides the .ea^ o do.n.^^^ ^ 
30 the ^ possihle, apart fr to 

. 'of MSKV.l infection and/or ^^^^^^^^ 
diagnosis " on the basis or 

i„ate a therapy m these agents 

evaluate a detection of 

..negativing tn 
35 patients' biological 
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i PCR teolmxque aerivea ^^^^ ^^^^ 

published b, «n'»^^'-«"'""r«ta\ extracted fro. a 

,2., was used. Fr» th. the cDHX 

fraction =f virion P-"'^^ '^^^ (SEQ ID «o.64) at 

synthesized usin, = J^-^f ^\^,uned . usin, exp«.dXK 

..-co«.c " 

(antisense) 

. oolY(G) tail was added at 

After purification, a ^ V transferases 

-^VX/ZrZ.X Boenrin.er Mannnei. 

^it" tuarketed by ^^.^ p^tocol. 

according to the manufacturer S P ^^^^^^ ^^^^^ 

An anchoring PCR was 



20 



lollowln, 5' and 3' P'*^"""! ccCCCC (SEQ ID So. 91) 

(sense) , . 



Next 21 senij. 

with the ciir ■ 

^GiTCTCaS «tTCGATAT « ( <„tl..n.e, 

The products o^^^"*""' ™i according to 
purified after purification onja 
30 conventional methods ("K ..f the 

.0 microlitres of ^^'^^^^^f in adding an adenxne 

properties of Ta, strands, the DHA 

It the 3' end of each of the ^^^^ 
Obtained was inserted dire=,txy -^^^^^^ ^ 
3. Clonin,™ .It J-^-^th Of sterile distilled water, 

solution were mixed w 



f t 
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Kn-Ffer "lOX LIGATION 

BOFm", 2 °^ ^ i„cub.t.d overnight .t 12'C. 

LIC^S.". ThU ""^j;", accordin, « the 

5 insttuctione ot the T» „,d„r., the white 

colonies =i "'=°*^""'\ ".'"Ho pe«lt extraction of the 
in order to be cultured and to P s„-called 
lalide --porated .cc^rdl^^^J^ 
10 •«nlP-p- procedure (17) • Th , ..stable 

,,,, recombinant "^""^ 3,aro.e gel. Pla™^^ 

restriction en.y». -^^JJ^^ „v light after 

possessing an insert detect ^^^^ 
staining the gel «ith ,,«idlz.tion «ith a 

„ seguenclng of the insert aft ^^^^^^^ ^„ 

orimer complementary to the Bp6 p ^^^.^ 
Tloning Plasmid of the » ™ \t„raing to the method 
« seguenclng then xit "^i" 

recommended for the u.. J is seguenclng Kit 

vtt dve deoxyterminator jr automatic 
20 reaction Kit oye 401384), and au 

(Appllea Wosystems, ref . ^^^^^^ eiosyste^B 

Uuencing was -"^^ 3,", .pparatus according to 
.Automatic seguencer, ' 

,L manufacturer's instruction^- ,^„i e 

PCR amplific'tion ace from the 

.hove was used on a cDNR y „^,.ies purified 
mentioned above wa infective particles p 

nucleic acids of fractions of in ^^,^„,,u. 

1 e sucrose ^"^-"^■^Z",':! culture supernatants of 
aescrib^l by H. perron ,rom MS, Immortalized 

30 B lymphocytes of a Ji„ B« and expressing 

^ ^ Rarr virus (kbvj =» +-T-anscriptase 

with EPStein-Barr reverse transoi. 

Trovfral P«"=- „ - ai- <3. ^ ""f 

activity as '>'«"''^„rio 11 =nd n. the clone I,BW, 

.etent ^PP"-"- .T^i^^ . S.0 1^ " 
35 whose sequence, Iden 



in Figure 35. 
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..Iblo to define, 
T« clone m.M» " I"'" ,i„„e (=« 

, . =M3 previously sequenc»l » „p„Bentatlve of 

Po"i- °' .«.-nce 

rr.n:nr::e ---- rr.^ 

„ .«lno .cids P ^^^^^ ^ 

Extraction of v .^fiy described below. 

«r.dinq to the method brietxy lyxaphocytes 
extracted according ,,pernatant ,or 

A pool o£ c centrxrug 

v^nts suffering from MS (650 obtained is 

of patients sui ^^^^.^^^^ pelie The 

^ 10,000 g- ^""^ _ PBS/ 10 tnM Mg^-»-2- 

„«rUl ie «e.«a w J^^^ ..en v.«> 

(50 m9/»l> "i""'^ 30 minutes at 46-0. 

then re-extr -stormed with 

"''„ecipi«^- -%t:rof li v 
, , „1 etwnol in the P^^^^^^^, centrUugetion « 
..2 The pellet obtained afte 
Td m Z .LroUtres of ete.Ue D^C ^^^^ 
„.u.penae^- „„pl. IS temperature^ 

35 extracted with 
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and 



of sodium acetate 

p.ecipl.«e. in presence 

, ^ V.V an OD reading 

ethanol. K^^med is quantified by an 

The RNA and the absence of DNA 

, contaminant ^^J^^^^'^ ^eci.ic ..OSA .or tne «SBV 
RTPCR associated wit^ 

genome. ^. ^^.g of cDNA: . primed 

synthesis oi svnthesize a cDNa f'- 

5 B, Of K«A are '° 7; ^,„,,i„, w the 

■th a POIY(OT) °"^°""°'r thesis Hodule- («£ 
0 with a f ' „ t,„^ synthesis n reverse 

instructions o£ the „ai£ioation.= The rev 

l»e, »er=he», J'^ i,e.». of 

transcription is P 

„co»«ended 42-C. p^itied by a a=ubl 

The synthesis P"""" according to the 

" ^traction and a double purlficatio 

^nufacturer.s ^n—f verified by an M^-i 
presence of «,e HSRV 

associated with a sp 
" 9eno»e. .^^^^ „,3tan=e PCR" : i^^^^ ,„.pcK step 

500 n, of ODW "\"*!:„.r (ref. 1681 842))- 
„.pahd «n, -Plate syste., - J,,,. used, 

several P"" ,/t>,e following Pri»«' = 

„ «ong these, the pair ^^^J^ , ,sE« ID »0:66) 

3. p.i»er= -=^="'^":,:ro conditions are as follows. 

The amplif ^^^^^"^ 

- r» eorrondS 



au»^ — 

940c 10 seconds 
5goc 30 seconds 

30 



35 



U u ^ 

5 ninutes; increment of 

,0 cycles, then 20 =VC'es; „ 

•„ oach cycle on the ex y ,.^.r.es of the 
20 -^"''^V'Mrrt Iplification, 2 -""""^^ ,,,„„d 
end of this first v subjected to 

a.pUfloation pr-i-t „ ,,,„e. 

amplification under the 
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.•on« are conducted in a Parkin 
The LD-PCR --^^^^^^ thin-walled microtubes 
.odel 9600 PCR apparatus 

--""^te a.pU.ication ^^^^^..^ZZ 
3 electrophoresis o. ^^J^^^^^^ - pair o. 

Cloning Of the a^P^fie^ f-^--' 

.he PC. product was pur...^^^^^^^^ P ^^^^^^ 

. p.eparative agarose .e^-^^^ ^upplier-s 

(Spin; D. Dutcher) 

instructions. purified solution are 

2 -croUtres of t P^^^^ ^^^^^^^^^ ^ 

^„i„ed up with 50 ng of vec ^^.^^.^ 



supplier's instructions 

Biotechnology)). obtained is isolated by 

recoi^xnant vector ^^^^^^^^ 

transformation of -"P^""* ampicilUn and the 

,0 are selected usln, their res .tan^ ^^^^^ 

lose Of metabolism ''^\J,„„t vector is confirmed 

rpTari/ ----^^ - — ^^"^ 

EcoRl. ^,„_e for all these criteria, 

FBdl3, a positive clon recombinant 

" .as selected. ^ rCpreP ,--- Hit ,ref 

plasmid was performed — J '"^ " ./^.^otlons. 
1„43) according to the suppuer^^^ ^^^^^ ^^^^^^^^ ^ 

sequencing of T^ne ^ .^^p. fs dve terminator 

30 means of the%erKin ^^-J^^^l^^T'ir mLufacturer-s 
Kit (ref. 402119) «<=«''"^^^„„, introduced into 
instructiions. The ^-'-^'^^ttomatic sequencer. The 

. . ^ ^ -t77 or 37 JA » ^t^A out 
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' Xtegy consists in gene val,cing carried out 

sequencing strategy 

J «f the clone FbdlJ. 
on both strands of tne c 
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, « vBdl3 is identified by 
The sequence of the clone FBdlS 

SEQ ID N0:58. ^,„ology between the 

in Figure 37, the s ^ . g^own on the 

„a«lx Chart by . """"^^^'^V «n be that there 

greater th.h or e,ual to 70*^ ^ ,,„e (w.th 

Ir. ho»,olo9ies in ^^^h the env ,e„e a.d then 

..e pol ,e„e at the f J, ,„«r„al region U 

the LTB at th. 3' end), ho-ology, even 

Totally divergent and ^<^''^'J^l\ p>..ther.ore. it is 
„aa., with the env ^ene " «=^^^„,,,„3 . ion,er "env" 
apparent that the '^""''J^'l^,,,^ tor the detective 
.e^lon than the one vh.ch xs de ^^^^ 
enlogenous HSERV-,; it 'hue he ^^^^^^ 

„»„t region constitutes an ^^-9 defective 

divergent ' J ^^^^^^ „ith the HSEEV 

regions oi f° 

genes. . . . ^eauence determines a potential 

This additional sequen ^^^^^^ amino 

orf. designated 0KFB13,vhich 

sequence SEQ ^^''^J of the clone FBdl3 was 

analyzed using the 

SwissProt data banks. ^^^^^^ 

5 giycosylation significant homology 

The protein does not have 
with already Known ^^.^ originates from a 

It is probable ^^^^^^^.^^^al element (ERV) , 

recombination of an -dog^- ^^^^^^^ 
UnKed to the -pUcation of MSRV ^^^^ 

such a phenomenon does ^^^^ endogenous 

t.e expression of P°^VPeptiae- ^^^^^^^^^^ ...crated by 
retroviral proteins which are n expression of 

:::: Im^une system, such a ^-^^^^ ,„,.eed by 

endogenous elements '^-^^^^^ ^he aberrant antigens 

- - to the induction Of 

and hence tends 
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. „ are observed in «=• " 
..tol^une processes such J ^^^^ .^^er^o 

cleerly constitute. » „ .ne data .anXs o 

described, in effect, interr g „ 
:rei= acid s«.e„ces -auabi in_^_ ^^^^.^ d^ 
„, the "Entrez" =0"""° ' .e<raence comprxsxng the 

^ - . -a. - rr hT 

«f a patient suffering ^as used 

fro. plasma of a P ^^^^ °°"''^rcar.ied out 

control plasma ^^^^^^ ,y„thesiB of cDNA was carrxe 
as negative control, in , . .^T) priiaer: 

w following modified oligoia^ ^ no:68) 
With the TTTTTTTTTT TTTT 3' (SEQ 

20 5- GACTCGCTGC AGATCGATTT TT ^^^^^^^ transcriptase 

and Boehringer Expand company. A 

„cordin, to t.e ---Ten'^nrl^ CConteC, u^« 
= performed with the en^y 930c 1 min, 58 c 

PCR was pert 01. 0400 5 mm then 

* -1 lowing conditions: 94 c o ^ and 

following es 

' final reaction volume of 50 .1. 

with a final rea 

rr/r.:::: --r. 

same as for the cDNA) ..semi-nested" PCR was 

A second, ^°-°^''^\^^,teei within the region 
■*.v, a 5' primer locatea under the 

carried out wi^ » J ^ per£or«d unde 

..ready a^plij-*- ^l^^,^, „ those used in 
S sane experlirental 
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""I-rfused ec. the »e»i-nestea .=R: 
. 5° prl»er, ld.ntifl.d by SE(! ID HO= 0 

" "Lli^rlo ID H0=., ana xo «0..,0 are 



r::ura.rp5^;re ::r\ra 

^•"t r pra^^a - t.. .e,.... con«oX. X.is 
not observed for rne f following manner. 

«e -P':^'^^'' 7 J^^^ , solution were 

„sin, t.e T* :;,";\3.illea water, X .1 of a 

mixed with 5 l-l "l butter "XOX LIGATION 

10-told concentrated ligation 

, „f "OCR™ VECTOR" (25 ng/«l) ana i 
,0 BUFFER", 2 fl °' .^.^^ incubated overnight at 

..TA DMA LIGASE" . This -ixture according to 

1..C. The toUowin, ;"P" =^^„„,„,„ (British 

«e instructions ot^ the 

Biotechnology) . pioked out in 

„ column. Of recomhlnant bacteria O-h^ ) ^^^^^^^ ^ 
order to be cultured and to perm .o-called 

plasmids ^--P-^^-^^ plasmid preparation from 

-miniprep" procedure (17). Th P^^^ ^ suitable 

,,,, recombinant "^^^ " ^..^^e gel. Plasmids 

30 restriction enzv»e and .naly^^^^ ^^^^^ ^ ^^^^^ ^^^^^ 
posseesing an insert ^^^^.^^ ^^^^^^^^ 

staining the gel >'xt'> nybridiz.tion with a 

sequencing of the inser^^^ ^„ 

primer <==«P""';""'^^ ,i„„i„, wt™. The reaction prior 

35 Cloning plasmid of ^« ™ ...^amg to the method 

to sequencing was tnen y 
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ir.n kit "PrisTO ready 
„c.»e.a.a .or ..e use ^J^^Zr. .e^encin, KU" 
. vit dye deoxyterminator i automatxc 
reaction l^it aye 401384) , 

^tr,«lied Biosysteias. ref. Biosystems 
- -"-"r Cone o.»inea aes.^. - 

. «f 467 bp Which is 89% hom y ^^g^ 

region of 4f' ^ retrovirus and a region 

region of the ^^^''-IJ^^^\^ pol -gion of KRV 
. ^ 64% homologous 

10 which i» definea. ^ its 

(NO. 1634 to 2856) to D ^^^^ i„ Figure 38 by 

The done rP6 is rep t^ree 

nuclectiae seguence i-"" f ^^one ere inalctea h, 
"rrseX- the nucleotiae sequence. 

AMPLIFICATION or Tnr. „eM3 ahu 

» e. REGION DEFINED BY TB* 

Ollgonucleotlaes --^'^'^^^.^nt .ere ae.lnea 

« — • neu"'":—:! - «-«rg r 

rrprtserrtne ..s^^^^^^^^^^^ -1 L .e 

„s. control reactions were P ^^^^ „,ter) . The 

of contaminants (reacti j<,iio„ea by » 

'"tmcatrln consists of . step of JJO« 
30 "Pl^^^r" of pri.ers "ere aefine 

ov«TaPPin9 regions (aesignatea o, B . ^ 
ZToSttZ. by th. seguences of 
poU aescribea above. 

.ea «T-.C. for amplification of the region 0= 
semi-nested RT 
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"'^""^ SBQ xo »0,,. (sen,.. 

. cFO ID NO:72 (antisense) 

rr«.e sella PCH eycle, the .oXX»in, p..— 

p.i„e. V. SHI 10 »0=73 (sense, 
4: SEQ ID H0.74 

nv'ir'ation Of the region E. 

" 3B« XO »0 = v. (sense, 

. qEO ID NO:76 (antisense) 

rrU' second 

" p.i.« 7-. SEO XD »0:77 (sense, 

• «v. a. SEQ ID NO:78 (antisense) 

XO K0..7, (sense, 

"° pri»er 9= S^Q XD »0 = SX (sense) 

'primer XX: SEQ XO »0=S. <a""-"-'^ , 
The primers end th. regions G, 
.hey aeline «e posi.icned as follows: 
cDNA ^ 
30 i ^ „ 8 6 

5 2 E L__a__-u 10 

> 

>< 

< POL* 

GM3 
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of the region defined by the 
-^-7 J\ ,eter.ined after cloning 

different clones G, E a ^„_iif ication products, 

and sequencing of the -nested a«pl ^^^^^^^^ 

..e Clones O K ana A w^^^ ^^^^^^^ ^ 

PCR with the primers 1 at t ^^.^^^^ 

11 at the 3. end of ^^g^.^p fragment G.E+A 

described above. An "^^^^^^^ ^,,3,^^ using the TA 
was amplified ^ence of the amplification 

Cloning (trademark) Kit. Th ^^^^^^.^^^ analyses 

product „as carried out. The sequence 

. ^ 4-vio recjion 
presented in Figure 40. 

IN THE DNA OF AN MS liXnrxi 

- ™: r was""- on ..e OH. e..»c«. 

A nested POT was P ^ ^^^^^^^ i™„crtaU.ea 

a ,Tb95 as described awve and as 

25 «ith the EBV virus strain . expressina 

the MSRV-1 i^„t ,„„ering froB MS. 

Moca ^y-^l^^^; Tcr step, the fouowm, primers are 

:ri.er 3.1.: ''-'^^"l^rr'rpU.Lt.on cvcles with 
This step comprises 3 p ^^.^ 

U,e foUowlnc, conditions: l mm at 
35 4 min at 72"C. 
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the following pri««'« 
PGR step, ^'^^ 
in the second PCR 

(SEQ ID no: 85) 

are used: ^CAAT CAGCATGTGG ^^.^^^ 

^^^^^^ ScCTCTT GGTCCCT.T ^^^^^^ ^^^^ 

"".is Step - -er. - 

^ .oUowm, condiUons: l 

. . Ifter Purific^ation on ^^^^^ ^^^.p^nded in 1° 
onaT-rhods (17), and --;;;;,:,erties of Ta. 

— r :o..-v:; r^ra 

,5 Biotecnnoiogy) • 'tion butfer". 2 .1 

' ° ted Ugation buffer "^^/f J/ dNA LXGASB" . 
concentrated 1^9 ^^^^ 1 °^ c. The 

This „„e ™ Kit 

„ follcwin, »«P^ „ """ Procedure, the white 

instructions "f P^'*'* ""'^ 

Blotecnnoio^^ » '"". UUion th. 

^" incorporated -<=<»^^^"' " preparation £roa 

„ piasmids in«>^ ""^r Jth a 

.„iniprep- P"'=»'» ,.s cut «i 

.ach recombinant co y ^^^^^ ^„ ,,,„se 9 ^ 

restriction "."^ ,„sert selected 

„as»ids „,.h ethidiu. --^;,::;„n «ith a 

,„ after Bta^n.n, th 9 ""^Cr present on the 

-nXi- "rv to the spe P--« /r,e..ion prior 
pritoer compj-e cloning method 

'cloning pi-^^ ° Ten pertor-ed aoc^^-^f .^le. read, 
„ se<r..ncin^ ""J-^^^ ,he se,uencin. Kit J 
„ reco^ended for the^^^^^^^_^^^^^ ,,,le 

reaction Kit ay 
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(Applied Blosystems, «t. 40X«4), ana automatic 
^^e^^n, was carried out with an Applied Biosy.te^s 
"rol^ic se^encer, .odel 3,3 apparatus acccrdin, to 

..e ^:rrar,natea .™c«l. couxe .a 

' obtained, ana is represented .V it. internal ee^enoe 

4rion<-ified by SEQ ID NO: 60. 

.aentifxed „l„„e is probably representative of 

+-0 FRV-9 present in human DNA, 

and capable of interfering with the "/^^ 

retrovirus, hence capable of having a role in the 
MSRV 1 re „ith the MSRV-1 retrovirus and 

pathogeneses — ^^^^ /^^^ ^ ^p^cific expression in 

capable of servxng as marker ror <* f 
15 the pathology in question. 

BXAHPLB X7: DETECTION OT MTPI-MSRV-l SPECIFIC 
"^"°°^^"a:rof the sequence of the pol gene 
,0 Of the ^ .etrovirus and - ^^^^^^^ ^ Tf 

— rorr:r g- r^^^^^^^^^^^ ... to be 

determined^ synthetic peptides corresponding to 

./of the protein sequence of MSRV-1 reverse 
25 fragments of the P ^^^^ ^^^^^^ 

transcriptase encoded by the p „f 

w . ,n<-iaenic specificity with respecc 
their antigenic p healthy controls. 

t:Cp-re ^ntesl/ed Ce^icaixy 

.Ud-pJe" syrt^esis according to the --^^ 

nioue (22). The practical details 



30 

nique 
below. 



rrp"r synthesi.ed on a phenylacet 
3, .ido^etnyi -^--rCtrcTtrr ua-. » 

(Applied Biosystems, inc. 



10 
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^ The amino 

are coupWd in ^^^^ „^ ""^ .,ch«. 

Novabiochem ^ a 
Switzerland). performed using a 

,„^.e c»upun, r;r..e resin, veU as 

solvent. The P-P""- J^^^^^ ^oups, x 

Bide-ch.in P""''"^^ \ ,,i«ble apparatus (W 
rvaro..uor.c ac^ (» - osa.a, .a^n, 

— ,V.a,. ra^'usea. »e 

„isole »"* 45 .mutes at 2 ^^^^ 

"''^""tel on unaer vacuu.. A«« ^"^^resin witn 
evaporate, on is eluted «o„ the 

id !na then lyopWU"*. -separative hl9h 

Ihe peptides are P-i^ed hy pr P 
<a ohroMtography on » v 
performance iwuid chr ^^^^,<,„ Group, Hesperi . 

column (»0 . ax "> J*\„/„«h an acetonitrx a 

USA). !"r„l/«n. The on a 

at a flow rate °' ' „„,er Isocratio 

monitored by - J'^^\' ,»0 . 4-' time are 

^^c« CXB ana^r^-i^ "^^^^^f.^ u then 

^ ^ ririyrphili-^- The P^P-^^-^/Z^ance U<.uid 

rr- i eyr ^^rrept- pu^:: 

U ^^--n/- .Xrlsentih, not 1... 

30 manitests itse g^am. analysed vlth the 

than 95% ot the Oh ^^^3 ere then „eing 

!« in ^''^ ::r,oid analyser. 

"^^'^"uedM V -s .aOH ""f„fJ":.U:iar -ass o. 
an (average) chemical m 

n ohtained using 

the peptides is 
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..B ZSEQ double focusing 

i«tru»ent connected ^^^^^^ 

— ^'^re\^rcuv«v - .-/.r:;::^ «s an. 

against sera J by a nucleotide 

rjftsiqnated sz'iw Mn.fi3 encoded oy 

trelce c. poX c. HS,V-. (S«. 



,U antigenic prcpert.es 
4-^«ted according ^o 
were demonstrated dissolved in 

described below. ^.^^ide was dis 

The lyopnilJ-ze^ ^ rog/ml- 

.ic acid at a concentration ^^^^ a 

10 % acetic and Kept at 4-4 C ^^^^^ 

3olution was a-.u^^ at -20»C -J -^^rt^ered saline) 
,ortni.nt or /^^JeptTde concentration of 

so !s to Obtain a final ^^.^tion are 

20 solution so as ^^crolitres of t ^^^^^ 

, micrograms .1^ ^ Jered witb a 

"Jn plates. The P^^^^^^^;, ^ hours at 07-0 
.icrotitration P ^^^^^.^^ ^ept f - ^^^^^.^^ 

„plate.sealer ^^n of the P^P^^^" ^.^^.ed three 

25 ^or tne phase of ad P^ ,ates are ^ ^^^^^^^^ ^ 

adhesive - 300 .icroUt-s ^^^^ 

times with a v ^^^^ ^ ^^en i ^j^^i, 

(IX' ^BS, plates thus drained a- 

absorbent tissue- The P^ ^ ^^^^^.^^ , ^" adhesive and 
30 250 microlitres per ^^^^^^^ ' are then washed 

of goat serum), ^n ^^^^es are 

lUate. -^^-Muon K ae r^^^n-n. tc 

-".nU .er» sa.^« ^ ^^utree 
3S i/-0 - tne «eiie c. each -i- 

auute test Beru» are 
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, 1, puoed in on» well 
, A negative w"""'. "„,\t„s ot butter »■ 

- 'tr c:-e:«i- .n .ane^ve are -n^ ^^^^^^ 

response, a P .^eted W '"f ,.,,„tlon . l/".""'' " 

,,einst solution B (^^^"^ tb. 

o« '^^"^ttn a- «U ot tne 

100 -1"°"'" „e then Pl"""' ^"^^ covered ««» «" 
,0 labelled and the pla"= ^ £„rther 

.,o.otltrat.o„ Pl-^^ \ "l-LttU as described 

adhesive are ^hen peri^° prepared 

"-^"%:'pariir- --nets^eu, .^^.^^ 

awe. in par ^^^ions of the 

^ioroUtres of su^^ ^^^^ protected 

„d the «w.rat«r.. stablU^*. 

30 minutes at ro reaction „e 

oT color . <"°«*'^°"'' reaction. The 

,„ -»^«°^^«r^tu ih°^-'r rrt P^te 

n'^s "arr Vaoed i-^-^ „;\cal densitv <oc, of 
t<-rophoto»etric r..«r.^^^„,,, n.^ 

e.- - r.«-- -rtro;:::^^ 

25 . vriplicat^' . •= calculated by ^^'^ 

or in triP^-^ tested is cax*'" -amole at 

^^na to the serum tes^^ ^^„e saropi 

corresponding to ^^^^^^ ^^tained for 

the «ean of t^^ _......onds to the »ean 



corte^f— values w — 

the wean of the „ean 

t s..e dilution- 

The net of ^ne 

used with the 

C-«> - r-hni,ue described a^« ^ „,,„s.V-l 
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in the serum of 15 patients for 
specific igG antibodxes xn th ^^^^.^.^^ed according 

^ ;3):\nd of .eait.V controls 

to the criteria oi 

(Blood donors) . , . th. results tor each serum 

. * rlgure 41 shows the r ^^^^^^^ ^ 

• tested with an -'^'^^/iX' (O" °' ' 

^present, the net optical density ^ ^. 

tested. The t Jt 15 vertical ..rs lym, 

top ol the vertical hars. The £ ^^ ^^^^^^ .^ra 

, to'the le.t o. the v-tica -.en^^^ ^ 

of 16 healthy controls (bio ^^^tioal woken line 

:U lyin, to the r^^f tested. The dl.^» 

^present the sera of 15 caee ^^^^^ the 

enables . controls tc -J^^^'^^^.tlon. These values Bay 
,5 grouped valuee o£ the — ""^ "J.^,, in symptomless 

represent the presence of spe „,iuated in 

'^positive -ral threshold of positivity 

order to determine the 

the test. ^,i,eB for the c^^^'^^"' 

The mean of the net ^^^^^^^ o.l29 

" includin, the r^ro^rwithout the . controls 

and the standard O-""';;"^^^ o.2, the mean of the 

Whose OD values ^'^^ the standard deviation is 

"-^"r rrtU ^l^eshold of po.itivity may 
" rcuJedtccordin, to th, formula: 

Of the net OD values of the 
threshold value (mean of ^ ^^^^^^^^ deviation 
negative controls) + ( ^^^^^ controls) . 

30 of the net OD values of the 

are considered to be 
xn the first case, there ^^^^^^^^^ ^^^^^ 
.^ptomless seropositives and ^^^^^^^^ ,t 

eTuaX to 0.11 . :,-°:Ug.ound.. of the presence 

35 represent a non-specifi 
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ta«n a» » re£«.n=. b«^-. , tne sMndara 

■ on the face of it. threshold 

«hich "''J" „„„„.„s control." « O-^"- 
deviation of the „ oe) - O-'l- . 

value than «=o.es 0 13 * P . ^est is 

According to this la ^^^^ ^^^^ 

, .pacific for HS. Xn thi-eaPect J. ^^^^^ „ , 

i! specific for MS, xn fact, this result 

has a net 00 above this thr . ^ ^,^,,„t, 

reflects the fact "J^, p.rt. higher than in 

„ffarin, fJ^rU in contact 

- -rrcordanca -ith the n«t^-- ^ , 

and as shown in Figure 41 an ^^^^ „^ 

tion, and positive result (OD g 

MS sera give a po presence oi 

« -^^ indicating tne f hence 
equal to 0.2). ^he S24Q peptide, he 

i<=tcally directed against ^^^^.^iptase enzyme of 

^ = m apparent good 

latter. ^ v,xooA donors m aHfc' 

TWO out of 15 blood ^pp^^^^^ that 

..alth Show a positive ^^^^^ population may have 

health sn f the symptomless p f ^^aq 

30 approximately 13 « ° ^^.,,^3 carried by th 

^ « in contact wiT^n to an 

rtide under conditions "X-"; j-f^^ persistence of 
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The absence of apparent 
the MSRV-X retrovirus. seropositive 
loaicri pathology recallxn. ^^^^^^^ .^rriers and 

tHaV in^i-^^ vLus after ino^unizing 

controls »ay infectious virus elation 

nave eliminated an ^^^^e an -^"^^f ,,ata 

, themselves or tn - ^^^^^^^ trtren^U^ent 

°^ '"Tat a pathogenic agent P^^^^^ ^^t^e cause of 
rr;ions - ^i.^ --^^rcUon o^tL population free 
disease i.piV -^^^ contact wit. such a 

- ^^%:r It - Ts "pathogenic 

-^^Trul : -;tutes all or part of thxs^ ^^^^^ 

---Ur-ragain.t components of 

15 type '■^^^ ,ntlbodl»s in 

retrovirus. a.teotion ot „£ieot t*. 

" cases t.s«a „ 

fact that epitope, tna against a 

20 itamunodoBinant MSRV i„„unxzat3.on 

strain variation .av^^ ^^^^ , or ^^^^^ 

divergent P^f ^'^^ and the treatments ^^^^ 

--^%"ovrtirtne antibodV -ponse aga.n 
modulate over ^ 
25 peptide. 
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TABLE No. 3 



10 



15 



Mean 

Std. Dev. 
Threshold 



CONTROLS 
0.101 
0.058 
0 . 126 
0.131 
0 .105 
0 .294 
0.116 
0.088 
0.1 05 
0.172 
0.137 
.223 
.08 
.073 
.132 
,129 
.06 
.31 



0, 

0 

0 

0 

0 

0 . 

t) 



MS 

0.136 
0.391 
0.37 

0 . 119 

0.267 

0.141 

0.102 

0.18 

0.411 

0.164 

0 .049 

0 .644 

0 .268 

0.065 

0.074 
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^ test for the presence of anti 
used to tesTi ^ above. ^ . 

antibodies in the same ^^^^^^^^ each seruxa tested 
Figure 42 shows the ^^^^ represents 
anti-igM antibody. Each vertic 
with an anti i.g» , or. at 492 nm) of a serum 

" ordinate axl. x,i„, to tha 

vertical bars. The f .ret » „i, 

,e£t Of t»e "--^^"^ controls (bloo-J donors, . 

^present the sera of of the vertical 

,0 and the vertical "-''f^^ cases of MS tested. 

The mean of tne 
tested is 1.6. values for the controls 

The mean of tne 

35 is 0.1' 



9 # 

o«ative controls 

OD values of 

ual to 0.7 ^ (3 X 0.6) ' 
ts tvence equal to 
^^e threshold value .on-specific 

«„its represent a ..yected 
.ne negative results^ .nti^odxes dxr 

...acxground.. J^/r^tope o. the pept-.^^ ^^^^^ 

Lcif ica^^^ lo this analyses, a- ^est 

the threshold. 6 ^^^^^ the «S 



course o „sRV-l. 
reactivation of « 
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TABLE NO. 4 



10 



15 



Mean 

Std. Dev. 
Threshola 

Value 



CONTROLS 

1.449 

0.371 

0.448 

0.456 

0.885 

2.235 

0.301 

0.138 

0.16 

1.0-73 

1.366 

0.283 

0.262 

0.585 

0.356 

0.-7 
0.6 



MS 

0.974 

6.117 

2.883 

1.945 

1.787 

0.273 

1.766 

0.668 

2.603 

0 .802 

0.245 

0.147 

2.441 

0.287 

0.589 



s a result of the new 
is possible. J developed by tne 

. made and the new »etho ^^^^^ation of 

discoveries made improved xmp ^^^^^^^^ion 

inventors, to P- infection an», 

the basis oi 

20 diagnostic tests ^ ,,d/or «^ ^^^^^ ^g,,,, 

ana to evaluate ^ the f ^ ^..^ermore. early 

its efficacy in « T^. ^^^^^gical fi^i*^^- ^ . „ neurological 
. the patient's bxolog displaying neu 

in the P .„/itviduals not ye^ xaaXe 
detection m -^^^^^^.^icl ^ .11 the 

„ .l,ns of MS or . „.,«ent *^'=^^%„.„^ clinical 

possible " ^^spect W the BUMeq 

Lre -"-'7 ,: rthat it vould precede 
course for the tact cl. ^ ^^^.^ 

,hich ,i„e. a „/iesion. has set 

'r^tre a .V-Pto-tolc^V J^J^^ the 

in. ah. hence nc ^^'^^^ e^^e- , 

'"•Tr. : reU »'--'=t;or " " 

3. iXr";-- 
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„d the present invention 
. of decisive importance, and the 
TovLes the means of ^-^^^f^^ ,,o. carrying out a 

It is thus P-"^^^'/\'^,/or reactivation, to 
diagnosis of in--"^ aW^ 

3 evaluate a ther^PV .n^-^^^ a.ents in 

;rrnr:.o.o.ica. f^ui.. 

. UKTERIALS AND METHODS 

Choroid Pl.x- .„in-=eU UM.rV. 

were obtained from » salpSWlSre, 
irl" Eeccurolles, »*P^"\tLe.l ceUs from 
"''""" ^anr;. Hon-tu„or.X -'^"^^^Ceri.ed (», - 
" "re Obtained as f"":"'%,,i,ntB used tor 

°°" herai """"ere obtained from the 

peripheral „ere 

"'"^"s are given in table 6. _iHcaticn 

MBplM are g isolation and purl viously 

-""°""u c.ll-«P« ««• """" 

" described (3, »)• ^ ..reened ^^1^'^. 

cultures were r ^ W-O"'"'-*'"^;^ used 

contamination with supernatant 

<"""'"rdeteotable nycoplaama. density 
" "^ra^riutrr vinon aescnbea P.J. 

nii\ From eacn . tubes, ^-^ 

V.d from the top of the ^^^^ fraction, 

collected fro ^ sterile tip to 

33 pipetman and a dxfje ^^^^^ 

60^1 we'^^ '^^^^ 
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. .fer containing AM guanidinium 

» free glycogen (Boenrx ^ oresence of l-^f*^ 

RNase-free g i «ater in the pree 

reoombinant ESase^ 

" l^rrseTanrriP-e ^ ^^^^ ana pol,- 

...activity was „act.on» 

sacroae gradients "' ^ .^^„o- BT-PCE with aege 
,nil& BVnthesis ana 
15 - ODSA SY .oe-io' at- «»» 

„gui«a in the "action =ont;i;lJ ^^^^^^^^ 

■ . The "Pan-retro , . the method oi 

vlrione. extracted „.ter. 

j„ perfon-ea on « ^^..^ivea in 20 .1 ^.^ 

rrr--on ----or™::-- OHase-i 

:.rjts <^--J-?:<^rtntain.g;.^--ro 

» Tis-HCi PH 7.., 0.. « 

nTT. 50 ToM ^.ix^itor: ( Pr omega ) . i-» 

^- -«4- RNase inhibitor units MoMLV 

"""ivatd at 80.= .or iO »in. 
then heat macti ^„ of ^ „ 

RT l^""""'' each tuhe in • „ min at 

" /rxreun: -a oO»* was sj^thes-d ^^^^ ^^^^^ 

.tiio^Ing ^ver-e Z .ouna i 

Tor s .in ;;;rpr::".:on, - ----Va:; 

f final volume 25 P of primers 

KCl 2.5 bM MgCl2- 200 ng treated with 

35 60 inM KCl, ^ ^ gach dNTP) was 

PAN-DI [see Figure 44], 
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t inactivated as above. 
0.3 units DNase-1 and ^^^J^ tm ^^elosure and the 

,.,,.cDH.wasadde xnt^eo^^^^^^^^^ 0.5 units .a, 

tubes heated to ^° /....^^^iiy to each tube ("hot 
polymerase (PerKin Elmer) ^-^^^'^^^I'l sS^C for 

imin, 34'Cfor 30 sec ^^^^^ ^ ^^^^^^ 

extension at 72 C o. M ^^.^e-treated PCR Bix 

transferred to the Round ^^.,3,, p^-ui 

(composition asjor -^^^^^^^ -t^^,, , PC. 

0 and PAN-DI) usxng the ^^.^^ ^^^,^3 ,„ly 

parameters were as for Round 

n«n «t 45»C for 1 min. 
and annealing at 4d v, 

- t:^:::^-^ cloned using the ..-clonin^^ 

, ,,t (B^tisr Biotechnology) according to 
manufacturer • s recommendations . 

. sequencing ^^^^tions were performed using the 

sequencing reactio ^^^^y^,,,i„ator cycle 

..prism ready reaction k Automatic sequence 

Biosystems, 373 A)- 

_ RT-PCR with STl primer sets directly from the 

The first PCR round .^-^^J^^^^ ,„,.3,ep RT-PCR 

25 cDNA reaction mixture accord-g 

technique described by "^^^^^J^;, J.^ty of airborne 
RT-PCR procedure reduced the P ,,,„,,erring PCR 

contamination when ^^^^^""'^ synthesis. RNA was 
reagents after an ^"^^^^^^^^/j^^ (snap-frozen in 

30 extracted as previously from 2ml o^P^ ^^^^ ^ 

liquid nitrogen RT-activity above 10^ dpm, 

sucrose fraction with a to ^^^^^ each RT- 

and resuspended in 50 M J i„cubated in a 

PCR, reaction lO^l of RNA s ^^^^ ^.^ 

35 PerKin-Elmer 480 ^^^^'^^J'^f ' ,oX DNASE buffer (50nM 
RNase-free DNASE 1 and 1.2 ^1 



100 

• lO/jAl Of Wase- 
0 1»M DTT) contaxnxng lU/^ 
^,3, 10^ Mgcl2 an^ 70OC for 10 

S^ 'Te s-uUon was P J 

^'""^ on^ with 88 .1 °^ each first round 

5 contamination) wi 4opM/tube of e 



10 



»0=100)). Each ««>e i»as tollo-ed 

"v - - ."'^/-^^ranl o». .o. ..^ 

maotlvat.on of ^ ,5-C for 1 ""^ .t 

„1„, 72-C for we" 15 

"VpJtix previously "-"^'^i^^^,, .».s. 
«cond round PC» ^ fo ^^^^ 

vl"" «"'"'r,„.c for " T of each second round 
inactive';-" -^^^ ^^3, "^"'"^.UUt^^^"*^*" '. 

%EQ ID ' Jot3. • o.ra,.eters 

30 Ugt^t tor analysis of 

- Hybridisation previously 
detection by BLOSA --"^^^'^'J^incatlons: .unc 

described (^D „ere coated ^^^^.^^ 

35 Maxisorb .xcrotitr P^^ ^^^^ 
veil capture probe cp 
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i-ion (21) or alternatively by using streptavidin 
Plates na^iotinylated CpVlb. Peroxidase- labelled 
coated plates an ^^^^ ^33^y 

™r"s e«.ac«a »o. .u»n MS p.a^a^ 
. ^an.a.» as aesc... -^--^ wrir 

reverse "ansorib ^^^ip^ reverse transcriptase 

''^""l^^CE, The resulting cPN.s were a^pUflea hy 

' "s«; ^tllough 35 cycle. (.4.C ^ Bin, =5'C 1 .n, 
15 semi-nestea PCR thro g ^.^^^ extension. 

72'C 1 min 30 sec) ana 72»c 8 min lo ,„n(ied 

7^ ^ i u^-^* . -reaion were ampliriw 

Three different fragments in the RT region 

by the following specific primers : 

- in the protease (PRT) region, for the 1st an 

.If PGR respectively, sense primer 

20 2nd round of pck. no : 103)] and 

nli^^^s'"' - - «^ ^ " 

:rr»o"rr/ - - - - - - - - 

,SBO ID HO:iOSn , «T region <«. n, 

" , the 1st ana 2nd round of PC, respectively, sense 

,6), for the 1st ana 2 cG« ACA G 3' (SEC ID 

pri»er (V A«G ACT ^'^J^ ^„ ^G TCT COC 

Trr»= / - .oc aac ..a 

- - CAA =C_3. XO »0:10S„ 

for the 1st ana /irt^r "T'-sr/o 



GCC TOT GCT CAC AGO AGA TTA GAT AC 3' (SEO ID 
primers 1= === ''^^ g^,. ^0 AGT GAG GA 3- (SEQ ID 

TO:109) / 5- AAA GGC * j^A GTT GCA 

H0:110)l and antisense primer 3 15 
CAAGTG CGCAGT C 3' (SEQ ID N0= 101) 1 • 
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xn. amplified fr.^e„« were analysed on 
ethidiu. brcide-stainea agarose gels, cloned in 
cloning veotor (Invitrojen) and s«^.no«J. 

2) KES01.TS . , ™,» u found in extracellular 

' :ir" «o."rpr.n.::ri:: ceu ou^ures and In KS 

— ■ ::;oid -j^>rtc=^^^^^ 

-Vrue to culture. expre.ein, 100- 
" 31, ,3,^i.«d wltn KT-activity similar 

120 nm viral partxoi ..oi-te (3)- similar cell- 

- -at o. -y^^^rVp" uc-'„eitJr't.ie ^.-activity 
..pes fro. "O'-'^J^'^l^ culture, were poorly 

nor virions. Ml ^ a li-ited 



c— .iently productive and, r . a a l^lt^ea 

U.eepan ..e^-re in .rde^ — e.ctracellular 

present - « J, ,pp„ach to amplify, with 

virions we «»d „„„3.„,a „,ion of the pol 9ene 

ae,enerate primer».J ^^.^^^^ „chni,ues 

,0 present ^. „ii,a "Pan-retro" RT-PCR. 

He t::lent of ..mples and r.„ent. was 

Tal Toause human D». contains many endo,.nous 
essential, beoaus technique. 

:;ti:."a .L performed on sucrose- 
« "Pan-retro ^Jf* J,^^ tr.. supernatants of 

aensity f and their non-infected 

alfferent "J^;;;^^,,^, „mplea post-mortem 

controls: (1) chor p ^^^^ ^^^^^ 

£rom MS brain (PI-I 1) . < > irraai.ted 
30 MS brain autopsy, infectea by non-inteotea 
cells obtamea by 

=--^''-r\n vitro transformation of two EBV- 
ro::r:e individuals, ...one MS ^.nt a. ,v, ^one 

" -r.-^rJivT ro^Ltnt 'fractions obtained 
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. ™ the B-cell cultures. The technique described by Shih 
from the B cell gemi-nested RT-PCR protocol 

-Kfc =1 /12^ was modifiea in a semx 

et al. (12) was extensive DNase 

using degenerate primers (Fig. 2) and exnen 
! tZt pel amplifications were performed in London 
!n Of ViroTogy, U.C.L.M.S.) on coded aliguots of the 
' r -tv ladierT fractions. Blind and systematic cloning 
density ^^^^^l/l „ere undertaken in an 

and sequencing of the pck p complete 

rtr^r. ..... .... 

vxr.1 ^'-^ corresponding RT-activity 
" '*'^:re%rrtions o, „„.„.ectea cui.ures. Xhe predominant 
™ detecrd in bands ot the expected ei.e tll« bp, 
iriified in ail tne .T-.ctivity positive fractions <but 
"t in L RT-activity negative fractions, was different 
v„^ retroviruses and was designated MSRV-cpol. 
from icnown ^^^^^ ^^1 homology (70-75%, 

rZ'Z:, rreviouet; :::=riLd endogenous retroviral 
?r 18, A lew ERV9 sequences (>90% homology with 

Triirrs l^d iow-temperature annealing, were found in all 

samples (Table 5, . consensus sequence of 

,0 Pigure .4 shows an -^--^^^^ ^^^^ ^ cf 

rertetr^tirres ngu^e » displays a phylo^enic tree 
blsed on the evolutionarily oonserved amino acid .e^ences 
: toth exogenous and endogenous /^J" 
this tree it can be seen that the poi gene 
35 region. From tnis tree 



t 
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MSRV " 



is phylcenlcally rel«ed to the c-type ,r=up =£ 

5 with MS. Identirio laborious and tu.e 

consumin,. «e , ,ap,„„ probe (CpVlB, 

,0 rapid --"---"j:,:~,;"cificityo, 
retrovirus' PCR products (Figure 4i) ™,gRv-eool was 

this sandwich -V^"^'-"°"-''"f 'nBIV aT Mo^, and 
tested With hoth distantly related 

Closely related (ERV„ pcl -^""'^^^^ -^^terved despite the 
X, reactivity with such ^^^^"^^ T^Z\s 0.01 ng of 
ability of the ELOSA to detect as little 

HSRV-cpol CNA. ^^^^ ^^^^^^^^^ ,0 

cerebrospinal fluid ( ^^^^ 

patients J= ^^^^^ „as extracted from CSF 

pellets, reverse tr „„aucts revealed MSRV-cpol 

analyBis (Table 6) of "•«J'=« ^ut In none of 

4« R the 10 MS patient sampxeh. yj^^ 
sequences in 5 of the J neurological 

the 10 samples from patients wit 
„ diseases ,P<0.0.,. The presence °' - 

^^n'ura^r^trtsTn'ly HO P--nt With 

was seen in untreated P ^^^.^^ ^^^^^ 

t^rirbren^rXpcl detection and CSF cell count 
30 was observed. p^l gene 

,eno..c se.ue.ce ^ ^^^^^^^^^^^^ 

liters of LM7-xnfected s ^^^^^^^ ^^^^^^ 

cells. A limited number of clones wa 



35 

cells. 
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which (PSJ17) showed partial 
Cloning of the cDNA, one of wh.c. ( ^^^^ 

homology with Jf;:;:, a«pUfied a 740 

--^^^^rrKin: t^e two independent sequences in HK. 
bp fragraent ^--^"^J pgji, „as localised on 

5 extracted fro» purifxe.^^^^ ^^^^^^^^ ^„ 

the 3- side of MSRV cpol. ^^^^^^ 

^■he 5- side of MSRV-cpol and on the 3 si 

the a Sail's from purxfied lm/ 

---.-fLrcS TmT lo^oTd^Lus cuUes <.) . 
like vxrxons produced nucleotide sequence 

^° ^" ^^^''''^ i«„nina clones obtained by sequence 

'T' liner xn «>e Cer^inal re,ion of the RT se^ence, 
underlined. In ttte ^^j^^es reaularly present in 

the dispersed a,»rnc add resld ' 

retroviral ».ase H domains, ™. in 

20 - „on-degener.te P'";"^" / Rj-activity . and in 

virions ..sooiated with the peeK oi 

'r"T('sTr"Trir.er set, positions e03-«S/l,»- 
POE primers (STX^l P i„ the pol 

1,14. on ri,.4, based 

" ^i^^lates ohtamed from different MS 

several different positions 869-889/1513- 

patients. Hested P"-" J^J';^; ,„^,„t (Figure 47, 
on ^^;;^^:J::J -Hidlu. hro»ide 

„hi=h /"^^^^ product generated by STl.l. 

,0 ^^.fp^oducts „as oonfir.ea by stringent 

rbrlat" il . Uidase-labeled MSKV-cpo. probe 

•--"'--"VnTrtmrtrr-used to detect 
The STl.l and 2 primer _,asma, although non- 

I Mcnv RKA in human piasma, 
" :::iri r thrUication. n^. 4, iUustrates the 
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It. of PCR ampli£ioation of cO»A derived from 2 MS 

I ^LT control Pla.B» .-Ple= 
patient and 2 coniiroi „^=dient fractions of 

r»«* *v«Tn the sucrose density gradient irao 
with CDNA from the ^aq-sequencing of the 700 bp 

Mq Choroid plexus isolate, laq ^ h 
Ld": - presence of MSKV — 

Jte .1 ".ere ..e,..ed 

corresponds to tne °° rt-PCR for cellular 

i«= usually sediment. Control RT pi-k 

rrreJitu on .-»--rdrj::.rr - 

v»,=.4- vhe results were not due to cexxuj. 
indicating that the resux^i ^„„;,^ation It should 

, ««H hv cell lysis during plasma separation, 
released by cell ly ^^^.^^^^ 

be noted that this sensitivity is impaired 

epidemiological studies since its sensitiv 
by the length of the cDNA required ^^''^ ^^^^ 

Non degenerate primers amplifying 

! vv,« »ol aene (the whole protease region, 
fragments of the pol gene K^n 

. « the reverse transcriptase, Cf. ng- ^ 
regions A and B of ^^^^ J^^^^^ presence of MSRV sequences 
were also used to confirm the P^--;; ^.^e 
in ONase-treated RNA from MS J'^f J,,„,3 ,,,b 

- rtrire.^^::-a:ai;sr^^^^^ 

were homologous (>95% an<» 
RT ^"^^°"\3;;"3,^ences previously obtained on 
respectively) to MSRV were detected in plasma 

culture virion. No such s q parallel with MS 

from healthy controls (n=4) , tested m p 



10 



15 



25 

plasma . 

3) DISCUSSION 

. Phylogeny Of MSRV ^^^^^^ 

From the resui.T:s o ^ , 



30 



35 



From the resui^^. ^ ^.^ "LM7" 

. ^ ^h«t the virus previously referred to as LM7 

se;uences ^^r^^^^^^^^^^^ ^^^^ ,,,, ,3 two amino 

The conserved RT motif retroviruses, apart from 

acids shorter than that of ^ functional 

,.4v.,iee«! which nonetheless navts 
human foamy viruses wnic prt-RT 
RT. The potential ORF encompassing the 
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. ^ +. tji^h the virion-associated RT- 

r/r.;u:rn„ "icr 

J c...e J— '^^^^^^^^^^^ ^orn 

recently ^"'^^^^'''^^^'^^Zl clone, frc MS plas«a, 
„e can confira the reality o sequences 

rr/nirrtror^^^^^ 

u^rt.K,n to confirm their ahility to encode en.y»e. ana 
.0 structural protein, o, -.V ;iri<.ne 

The phylo.en.= tree .n ; retroviruBee 

conserved amino acia ^^^^ 

ZZ Jg YXDD) , Shows that the MSRV pol gene xs related to 

(VLPQG. . .^xuu; , -closest known 



/-iTT-Dor YXDD) . shows tnat uiic i^w*^. r- 

( VLPQG. . .Yxuuj , closest known 

the C-type oncoviruses. Apart from ERV9, tne c 
the C typ RTLV-H, a human endogenous sequence 

15 retroviral element is Rii^v n, 

r i.r> have a subtype with a functional pol gene (32). 
rrU rio:. t^le pnylo^emc a^matlon to C typ^ 

annarently contradicts our previous 
oncoviruses apparem-xy 

assumptions based on the general 
a„ particles observed by electron microscopy (EM, whrch were 

= B or D-tvpe oncovirus (3, 5, 2t>) . 
compatible wi«. a J /.L on env sequences detected in 
However, preliminary aa phylogenic proximity 

MSRV virions, would suggest a greater phyi g J' 

" trrcririrr :y'':.;r"a 

" rooXrtorTaTortin in .-type — iru... .3,^ 0 to c 

the protein can convert retrovirus .orpholcy to that 

" ",:Lrt*" ~s retrovirus sharin, extensive 

- ^^ . „i,ted endogenous retrovirus family or an 

T I^u^lrov ruTp-oducin, extracellular virion., 

sll::: blot analysis .ith an MSKV pol probe 

3. under stringent -^^^--^--^rT'^jT..: 
multicopy endogenous family (dat 
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. ^. existence of endogenous elexaents more 

llnlifloltlon of HSRV-rslated endogenous sequences. Since 
Z L lTuou. conserved retroviral gene, the se<r>.nce 
Ascribed here is tne Xeest citable region to 
discriminate between exogenous and endogenous se<p.ences. 
10 xtTs hoped that seguence information from other parts of 
10 It IS n p .„.h a discrimination, would it be 

the genome may permit ^ demonstrated for 

on a tiny portion as has recently ^^^^ 
the Jaagsiekte retrovirus (JSRV) ot sne p i , ,^„,,,„ 
J data it would then become possible to identify 
r TKV-t-f^ provirus in the genome of virion- 

Z7lZrX~ virion-producin, exogenous member 

MSRV couxa 4^^«^iv lUSt as exogenous 

^ Tn?vQ-Tike endogenous family / D^s^ ^ 

::r.rns"in the -;:;7-;:rrri 

3„ Virus <H«XV, and ^rdirTfamily 

families of mice, and also, in un 

ortheep OS). Alternatively, it is ^^^J^^^^:^'^ ^^'l 
the extracellular MSRV virions may be P^^"^^^^^^ 
replication-ccpetent endogenous P--^^* . J'^^.^^ 
25 exogenous or endogenous, conceptual "-^-i^^^ ^^^^^ 
with the category of retroviruses represented by MuLV 
^ and JSRV. unlike defective endogenous elements, thxs 
Tego^y Of agents are Known to produce infectious and 
category J* neurological disease (37), 

pathogenic virions, to c 33) and to express 

30 solid tumours / leukaemias (36, 38) ana / 
30 soj-xu aa 40). Furthermore, m MuLV 

:::=: r^rreni::;noui -ovirai ba„d 

resistance to disea.e (3 , endogenous 
between an mrecrious l^^*- . mc 

co^terpart may be relevant in the pathogenesis of MS, 
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..„ce e„ao,e„cu. ..«ov.... " ^ Ttt'ti^ 

,U individuals. A g.n.tio "'^'l^.Zlr. con^ri>^t. 
endogenous retroviral ^'^^^ ^^'^s (43) , if MSRV 
tc the known hereditary eusoeptibility to MS (43), 

^ r^rU" -"^v, e^e^ent. 

„.ewhere the data .n ,„„,.activation in 

«»y be oo-expressea, p heterologous BMA 

infected cells, ^^^^^"^....^i^o^s pacXagln, is 

pac.a,in, in »-\™^;/„:f„r3,,, J ,4.,. 

10 known to occur in other J previously evoked 

. i role tor the nunerous common viruses pr 

' A^on, the numerous reports o. -.uses putatively 

involved in the ""--'--J"; "^'.^uies, the 
1. proportion focus on^ ^/^.^....aae. Regarding the 
p.ra.y.ovir.dae a^d the ^J^^^^^^ . 

paramyxoviridae, ^^"^ i„ „S patients 

increased -"*^'^^""\;%:r against measles fusion 
essentially directed in C3.,^^ -i-"" similarities 

20 protexn (44). The ex proteins of 

^.ween — ^^^^^^ "Lansmemhrane protein of 
paramyxovxridae and t ^^^^ ^^^ervation if 

retroviruses (45) , nay = [ protein, 
antigenic cross-reactivity between these 

25 occured. the involvement of 

With regard to the ^fex virus type 1 

^.«i„-Barr ^^^^J^^^Vy, uLn Herpes Virus . (HHV-e) 

30 end from those of ^ther^ 

herpesviruses »ay pi that HSV-i immediate-early XCPO 
expression, we hav „SRV/l«7 in vitro (6) 

and ICP4 proteins can transacti ^^^^^^^^ 

end Haahr et " , „o-£actor in MS, 
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description by challoner et .1. (47) showing .ignxficant 
."ression Of BHV6 proteins in MS places .ay also suggest 
a sli»ilar role tor HHV6 in the brain. 

5 BOHPLB 1. . HSRV GEI»H1= DETBCTIOH TBCHKIS.E 

Following 0.4 ;.m filtration to remove cellular 
aebris ana B«ase digestion to remove residual n^- 
encapsidatea m^. serum was processed to extract vxral EHA 
by .eans of .asorption to a silica matrix. Vxral BKX was 
10 sub'ctea to DNase aigestion, then a combined reverse 
t«nscrlption-PCR (RT-PCR, reaction was performea usxng 
primers PTpol-A (sense, Vxxxx,., SEQIDH0;18„ ana 
l^ol-F (antisense: s^xxxxV, SEQ ID »0=184, . A second 
^La Of amplification with nestea primers ™P»^- '--^ 
15 5.XXXX3', SEQ ID BO=185) ana PTpol-E (antlsense: 5'xxxx3 , 
" S^^^ ^-""^ ^ product Which was 

identified by g.l electrophoresis. The specifxcxty of each 
produce was confirmed by aiaeoxy sequencing, control 
reactions without reverse transcriptase were Pe^'orm-a to 
20 ensure that the proaucts were aerlvea from vxral RHA. In 
t^altion, to exclude the possibility that the extractea 
:!rat BiIa might be contaminated with host cell aerxvea 
nucleic acias, ali,uots were tested by nested PCR tor the 
presence of pyruvate dehydrogenase (PDH, ™* J"*^ 
„ samples which generated a signal in either the PDH or the 
"no-RT" PCR assays were excluaea from the analysis. 

sera from patients with clinically active HS ana 
controls were ampUtiea by RT-PCE and seguencea virion 
associated MSRV-RHA was detected in the serum of 10 of 19 
30 : ; patients with MS but in only 3 of - --^^ 
without MS (P.O. 0001). The control group consisted of 8 
laments (all MSRV-RMA negative) with rheumatologxcal 
dtsorrers Ind ,e healthy adults. MSRV-RHA titres in both 
MS Tatlents and controls were apparently low ™ 
,5 moaerate dilution of sera «10 tola) caused loss of 

signal. 
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„ „s patients, d.t.=tion of HSRV-«.» was not 
. sex disease duration, or MS type, 

associated '^^^^ correlation with treatBent 

However a -"^J^^^^^,, „,.ai„ea frc» the i, 

""ti:::s 100* T the \era .ro„ untreated patients 
patients , lo» „_Rv-!aiR whereas it was detectable in 
r/roritr^^^-rlained dunn, tre.t„ent with 

.orLcosteroids -/-^--^"nt loVs o. virion 
The reason for t.ne cifp^j. ^4, 

_ciated — .—nr::; r;r:iue 

r:::trl o: 'r detection HS.V m cerebrospinal 
fluid. 

TABLE 7 

O.XECTXO« or VIBXOH .SSO«»ZD MSKV-PN. X» MS ^TED 

PATIENTS & CONTROLS 




20 



25 



a,^ control group consisted of 8 patients with 
a The control gr P .a„its. 
miscellaneous non-MS disorders an „„trols in 

b The detection ct MSRV RHMn P^^^' f ^^'^"^^ ,„„,,,,ent 
conditions which select P-"''''^^^-^^^^ 

:;:='. lied? such -rri-.orsrjirairre 

carriers or be in the pre-clinical (or 
of the disease which can last for years. 
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• . -^-H filtration and RNase 

- Modified SNAP KNA extraction wxth filtration 

digestion at roo^ temperature) 

ip to 500 .icroUtres of serum is filtered using 
. Ln filters (Nanosep MF from Flowgen 

:;::io:ur::. -01.:^^. OOM.5). serum is spun for 

catalogue ^^^^^^^ ^.^ necessary) . 

^ " ' mLolitres of filtered serum is incubated 
, „ith 10 units RNase One (Promega Catalogue NO.M4Z61) for 

150 microlitres was then extracted using the 
SNAP KNA extraction Kit (Invitro.e ) - 

- 10 micrograms of poly A kwa wa 
„o ™i=.oU«es Binain, -«.r « a car. er . 

flrfri&d to the serum and mixea oy 

^ SNAP column and spun ijuu y 

ana t^e f o:-rrou,h aiscaraaa ,• the remainder 
20 1 min and the tiow ^ 

then aaaea t.. SK*P col-™ a a p„„ a. .JOOJ ^^^^ 
ana the flow-through aiacaraea , ^ 

.ashea «ith 600 .icrolltrea of ^-'J^^/^I.^^o 

, J . +-v**a riolumn was then wasneu wx^*i 
through alacaraea < fx„„-.hrou,h 
25 microlitres of He KM* " 

alacaraea ; this .ash was repeated w th a 
•« ar,r! the flow-through discarded , tne 
7Z was ttn elutea h. incuhatin, with 135 .icrolitres of 
aciu wa= _^„_ a*. o for 1 mm. 

t,M«oo free water for 5 min and spun at 130u g i 
B»ase free w. ^^^^^^^^ ^„ff„ and 3 

= ,30 units) of D»ase I, RNase free (Boehringer 
micrcl.tres (30 units) i„,ubated for 30 

-■^f 3 Tso JlcroxLes of Binding Buffer was added 
Ta t:ea\; inversion . times , 300 microlitres of 



35 



and mixed .y ^-—J-^^ .^.^d by inversion 10 

propan-2-ol was then ^^^'\ 2sf.rreA to the SNAP column 
times ; 500 microlitres was transferred t 
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and spun at 1300 g for ^ _ ^^^^ 

coin was ...n -.shea «i» eoo 
dlsoarded , ">« tiow-through discarded ; 

,.is wash was repeated ' J,,,/ Jd was then 

rr;;" crtC^it-or ..croX.tres «e. 
:ater .or . and spun at 1300 , for 1 Bin. 

- Titan RT-PCR Titan one tube RT- 

RT-PCR was performed usxng J^^^" ^ 

fBoehringer Mannheim cat. No. 1 855 476) 2 
PCR system (BoeUr^ng _^ ^^^^.^^^ ^^.p,^ 

"'°^:"on" The toral reaction volume was 50 microlitres 
i reaction. Tne i^oi-ax pvrase inhibitor used. 

/in iinii-fi\ was the RWase xim^^ 

™- SB0lONO:i83 and SBaXDNO:184 

170 ng of ^^-^-^^ J I ^,3^,, was prepared 

respectively, were used A sx g ,,^ed at room 

and the sample RNA added last. This w 
0 temperature, not on ice. sequential 30 min 

The RT step consisted of two s q ,„„^diately 
^ 60»C. This was immediately 

_ sinoc and then ou v». 

rreH: :i r» wh.ch .d -rn., 

* ---i-rr"r orc^s • .or 30 seconds , 
j5 * 40 cycles of 94 £or 

68 "C for 45 seconds, 

* 1 cycle of 68OC for 7 min. p,,,,^ed using the 

The second round PCR was pej. 

..pana te„pl.te - -^^'r ^^.^ X wa " a^fed 

30 »o. xes. »..,^ -j-^:-:::: rrr/.u«er «». . and 

"/r::r::« B .nd\ were usea. .he PCK had the 

r"°j;tle;'on.-c .or 30 seconds, so-c .or 30 seconds.. 
35 68»C for 45 seconds, 

* 1 cycle of 68«C for 7 min. 
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1 min and the flow-through 
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The PGR products were then run on a 2% agarose 

The no RT controls were performed using "Expand" 
PCR system for both rounds. The first round was 40 cycles 
and the second round 20 cycles. 

AS a positive control a DNA dilution serxes was 
used in both the RT-PCR and the "no RT" PCR. For a result 
To be valid the RT-PCK and "no-RT" PCRs had to have 
detected DNA equivalent to between 1 and 0.1 

The analysis of PCR products of an approxxmately 
435 bp fragment in the pol region is shown in Table 8. 

TABLE 8 

ANALYSIS OF PCR PRODUCTS WITH ORF * 



15 



20 




25 



* Defective RNA 
virions , since 



can 
the 



also be present in circulating 
fidelity of the MSRV reverse 
virions, Since //"^^'^'^^ 3ince recombination 

then obvious that the in illustrated in this 

•«v,<H+-v can be greater than that illusT:raT-e 
variability can oe y . ^^^^^ ,iof active MSRV RNA 

example, because of these encapsidated defective 

copies. 

Table 9 which data have been determined from the 

^ ^ x?^r,ur-&a 49 to 53, shows a variability : 
35 alignments of Figures 4» to a , 



30 
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- .atween the clones obtained fron the seB. p.ti.nt pl..B. 
sample In the same PCR amplification experiment , this 

that the patient possesses a virion population which 

comprises dl«.r.nt MSRV variant. ='\V^,"'V';""' 

- between the sequenced variant populations from '^^"-""^ 
patient. , this means that the variants differ from a 
patient to another patient. 

TABLE 9 

Degree of identity (percentage) between nucleotide 
sequences and between peptide sequences, 
by direct coxnparison of said sequences (see Figures 49-53) 



Patient 

Nucleotide 
sequences 



Peptide 
sequences 



between SEQ ID NO: 169 
and MSRV-pol (SEQ ID N0:1) 
90,4 % ^ 

92,3 % * 

SEQ ID NOs:170, 171, 
172 between them 
98,6 % ^ 




SEQ ID NOs:173, 174, 175 
between them 

97 % 



between SEQ ID NO: 176 
and MSRV-pol (SEQ ID N0:1) 
82,5 % ^ 

84 % ^ 

SEQ ID NOs:177, 178, 
179 between them 
94,5 % ^ 

95,1 %^ 

between SEQ ID NOs:180, 
181, 182 and SEQ ID NO: 
73,5 % 

SEQ ID N0s:180, 181, 182 
between them 

89 % 



1. a, this percentage is determined on the basis of sequences 

including the primers. 

o «-.A and 53B, the variability between tested 
20 From Figures 53A and sjb, 

patients sequences can be determined : 



t # 
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. between SEQ ID NO:169 and SEQ ID NO:176 : 16,5 %a and 
the peptide sequences obtained froB 
SEQ ID no: 169 and SEQ ID NO: 176 : 20 %. 

pour xnicroorganisxas are mentioned ^^^^ 
1. 2 lines 15-26 and they are identified 

rccl^aal With the provlBicns o, the Bu..pest T...t,. 

^° •4. A 9?nd JulV 1992 under No. 92072201, 

- LM7PC deposited on 22nd July i^^* c -,0108 17 

.u ^ ^„ tn-h Tanuarv 1993 under No. 9301081/, 

- PLI-2 deposited on 8th January vq2072202 

^'.^A Tiiiw 199 2 under No. V9zu/^^u^» 
_ POL-2 deposited on 22nd July 199^ 

^""^ •4. «n «th January 1993 under No. V93010816. 

15 - MS7PG deposited on 8th January 

* ECACC : European Collection of Anixnal Cell Cultures 
vaccine Research and Production Laboratory 
public Health Laboratory Service 
20 centre of Applied Microbiology and Research 

Porton Down 

Salisbury, Wiltshire SP4 OJG 
United Kingdom 
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(ix) T£LECOMHUNICATI0N INFORMATION: 

(A) TELEPHONE: 4 72 69 84 30 

(B) TELEFAX: 4 72 69 84 31 

5 (2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleotide 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 





CCCTTTGCCA 


CTACATCAAT 


TTTAGGAGTA 


AGGAAACCCA 


ACGGACAGTG 


GAGGTTAGTG 


60 




CAAGAACTCA 


GGATTATCAA 


TGAGGCTGTT 


GTTCCTCTAT 


ACCCAGCTGT 


ACCTAACCCT 


120 




TATACAGTGC 


TTTCCCAAAT 


ACCAGAGGAA 


GCAGAGTGGT 


TTACAGTCCT 


GGACCTTAAG 


180 


20 


GATGCCTTTT 


TCTGCATCCC 


TGTACGTCCT 


GACTCTCAAT 


TCTTGTTTGC 


CTTTGAAGAT 


240 




CCTTTGAACC 


CAACGTCTCA 


ACTCACCTGG 


ACTGTTTTAC 


CCCAAGGGTT 


CAGGGATAGC 


300 




CCCCATCTAT 


TTGGCCAGGC 


ATTAGCCCAA 


GACTTGAGTC 


AATTCTCATA 


CCTGGACACT 


360 




CTTGTCCTTC 


AGTACATGGA 


TGATTTACTT 


TTAGTCGCCC 


GTTCAGAAAC 


CTTGTGCCAT 


420 




CAAGCCACCC 


AAGAACTCTT 


AACTTTCCTC 


ACTACCTGTG 


GCTACAAGGT 


TTCCAAACCA 


480 


25 


AAGGCTCGGC 


TCTGCTCACA 


GGAGATTAGA 


TACTNAGGGC 


TAAAATTATC 


CAAAGGCACC 


540 




AGGGCCCTCA 


GTGAGGAACG 


TATCCAGCCT 


ATACTGGCTT 


ATCCTCATCC 


CTU^CCCTA 


600 




AAGCAACTAA 


GAGGGTTCCT 


TGGCATAACA 


GGTTTCTGCC 


GAAAACAGAT 


TCCCA6GTAC 


660 




ASCCCAATAG 


CCA6ACCATT 


ATATACACTA 


ATTANGGAAA 


CTCAGAAAGC 


CAATACCTAT 


720 




TTAGTAAGAT 


GGACACCTAC 


AGAAGTGGCT 


TTCCAGGCCC 


TAAAGAAGGC 


CCTAACCCAA 


780 


30 


GCCCCAGTGT 


TCAGCTTGCC 


AACAGGGCAA 


GATTTTTCTT 


TATATGCCAC 


AGAAAAAACA 


840 




GGAATAGCTC 


TAGGAGTCCT 


TACGCAGGTC 


TCAGGGATGA 


GCTTGCAACC 


CGTGGTATAC 


900 




CTGAGTAAGG 


AAATTGATGT 


AGTGGCAAAG 


GGTTGGCCTC 


ATNGTTTATG 


GGTAATGGNG 


960 




GCAGTAGCAG 


TCTNA6TATC 


TGAAGCAGTT 


AAAATAATAC 


AGGGAAGAGA 


TCTTNCTGTG 


1020 




TGGACATCTC 


ATGAT6TGAA 


CGGCATACTC 


ACTGCTAAAG 


GA6ACTTGTG 


GTTGTCAGAC 


1080 


35 


AACCATTTAC 


TTAANTATCA 


GGCTCTATTA 


CTTGAAGAGC 


CAGTGCTGNG 


ACTGC6CACT 


1140 




TGTGCAACTC 


TTAAACCC 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(t) SEQUENCE CHARACTERISTICS: 
g (A) LENGTH: 297 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



15 



^.a. CTACATCAAT TTTAGGAGTA AGGAAACCCA ACGGACAGTG GAGGTTAGTG 60 

CCCTTTGCCA CT^«^^^^^ ,,,,,cTGTT GTTCCTCTAT ACCCAGCTGT ACCTAACCCT 120 

::rrc~TA:=OCAGAGTGGTTTACAGTCCTGGACCTTAAG ISO 

— TCTGCATCCC TGTACGTCCT CACTCTCAAT ~ C^^^^^^^ - 
CCTTTGAACC CAACGTCTCA ACTCACCTGG ACTGTTTTAC CCCAAGGGTT CAAGGG 



20 



(2) INFORMATION FOR SEQ ID HO: 



3; 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

-,0 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

" J:1J«.ccc»>tc ™c. ~cc occct^ - 

AGGTCCAGSN ACTCTGTYCC TTCAG 



35 



9 0 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 86 base pairs 
5 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOG^f: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GTTCAGGGAT AGCCCCCATC TATTTGGCCA GGCACTAGCT CAATACTTGA GCCAGTTCTC 60 

86 

ATACCTGGAC AYTCTYGTCC TTCGGT 

15 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 85 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
OT,«»O.T .OCCCCCTC T»TTXaOCC» .^^.T^CC C«O.CTTC» O.C«TTCTC .0 



30 ATACCTGGAC ACTCTTGTCC TTWG 

(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 



# 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS5 single 

(D) TOPOLOGY: linear 



5 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6s 

aTTC«»OAT .CCTCCO.TC TKTTTCOCCT =GC»IT»CC CX=»C»CTT« «»>«TCT. 60 

10 ATACGTGGAC ACTCTTGTCC TTTGG 



(2) INFORMATION FOR SEQ ID NO: 7: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTOTTGCCAC AOGGGTTTAK ROATANCYCY CATCTMTTTG GYCWRGYAYT RRCYCRAKAY 60 
YTRRGYCAVT TCTYAKRYSV RGSNAYTCTB KYCCTTYRGT ACATGGATGA C 



30 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



# # 
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(ii) MOLECtJLE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8: 

' ... CCCCCIVTCTA TTTOGCCAGG CATTAOCCCA AGACTTGAGT CAATTCTCAT 60 

TCAGOGATAG CCCCCATCTA TTT ^^__„cT TTTAGTCGCC C6TTCAGAAA 120 

™aC .c™ — ™- ,30 
CCTTGTGCCA TCAAGCCACC CAAGAACTCT TAAC ^^^^^^^ 

^™rcrG:rcrArGrc:™ - 

10 CCAAAGGCAC CAGGGCCCTC ,,,,cATAAC AGGTTTCTGC CGAAAACAGA 360 

CCAAAACCCT AAAGCAACXA AOAGO^^^^^^^^^^ ^^^^^^ ^^^^^^^^ 

TTCCCAGGTA CASCCCAATA GCCAGACCAT TATATA cTAAAGAAGG 480 

CCAATACCTA TXTAOTAAGA TGGACACCTA CAGAAGTGGC XXTC 

^^^r^/^nrTr TTCAGCTTGC CAACAGGGCA AGATTTTTCi 

CCGTGGTATA CCTGAGTAAG GAAATTGATG TAGTGGCAAA GGGTT 



15 



20 



25 



(2) INFORMATION FOB SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



30 



35 



..ri^CTCTT AAATTTCCTC ACTACCTGTG GCTACAAGGT TTCCAAACCA 60 
CAAGCCACCC AAGAACTCT. ^TT ^^^^^^^ ^^^^^^^^ 

^GGCTCAGC TCTGCTCACA GGAGATTAGA ^ ^.^CTCATCC CAAAACCCTA 180 

.OGGGCCTCA GTGAGOAACG TATCCAG- L^cAAG ATTCCCAGGT 240 

-™ CAGGGTTCCT T^CA^ ™ ^ _^ 

= = — — 
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CAKGCCCCAG TGTTCAGCTT GCCAACAGGG CAAGATTTTT CTTTATATGG CACAGAAAAA 420 

ACAGGAATCG CTCTA6GAGT CCTTACACAG GTCCGAGGGA TGAGCTTGCA ACCCGTGGCA 480 

TACCTGAATA AGGAAATTGA TGTAGTGGCA AAGGGTTGGC CTCATNGTTT ATGGGTAATG 540 

GNGGCAGTAG CAGTCTNAGT ATCTGAAGCA GTTAAAATAA TACAGGGAAG AGATCTTNCT 600 

GTGTGGACAT CTCATGATGT GAACGGCATA CTCACTGCTA AAGGAGACTT GTGGTTGTCA 660 

GACAACCATT TACTTAANTA TCAGGCTCTA TTACTTGAAG AGCCAGTGCT GNGACTGCGC 720 

741 

ACTTGTGCAA CTCTTAAACC C 



10 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGGAAAGTGT TGCCACAGGG CGCTGAAGCC TATCGCGTGC AGTTGCCGGA TGCCGCCTAT 60 
AGCCTCTACA TGGATGACAT CCTGCTGGCC TCC 



25 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 
2Q (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



9 



# 
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.CTCCCC. CCCCOCTO^ CCCX.TC.CO .OC.OTTOCC CC.XOCCCCC SO 
TATAOCCTCT ACGTGGATOA CCTSCTOAAO CTTOAO 



5 

(2) INFORMATION FOR SBQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 12: 

r... ACCGCTTGCT GGATGTAGGC CTCAGTACCG GNGTGCCCCG CGCGCTOTAG 
"'"Tag Agreed GAAACACGCG GGACCAATGC GTCGCCAOCT TGCGCGCCAG 
TTCGAT6TAG AAAGCGCCCG G~Mi ccCATGGCGC CGGAGAGCGC 

0 eOC^CO^TO ™» OCOC..O0CC - 

"-^-^^ ^ ZZZ. — 

:= = ~ — r= = 
— := = := cc^c 

-3-=^ r.cco. cc^ccccoc 

:rrc r.... — " .00=0.0. 

30 AAGCCTATCG CGTGCAGTTG CCGGATGC 

(2) INFORMATION FOR SEQ ID NO: 13: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 



t # 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GCATCCGGCA ACTGCACG 

10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTAGTTCGAT GTAGAAAGCG 



25 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 
2Q (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: cDNA 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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GCATCCGGCA ACTGCACG 



5 (2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



15 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGGAGTAAGG AAACCCAACG GAC 

20 (2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



(il) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TAAGAGTTGC ACAAGTGCG 

35 (2) INFORMATION FOR SEQ ID NO: 18: 



18 



23 



19 



9 
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15 



20 
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(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

10 21 
TCAGGGATAG CCCCCATCTA T 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 24 
JUVCCCTTTGC CACTACATCA ATTT 



(2) INFORMATION FOR SEQ ID NO: 20: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(ix) FEATURES: 



(B) LOCATION: 5, 7, 10, 13 

(D) OTHER INFORMATION: G represents inosine (i) 



5 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGTCGTGCCG CAGGG 

10 

(2) INFORMATION FOR SEQ ID NO: 21: 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 (ii") MOLECULE TYPE: cDNA 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTAGGGATAG CCCTCATCTC T 

25 

(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

30 



(A) LENGTH: 21 base pairs 



(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



21 



35 



ii) MOLECULE TYPE: cDNA 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TCAGGGATAG CCCCCATCTA T 

5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE; nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



15 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AACCCTTTGC CACTACATCA ATTT 

20 

(2) INFORMATION FOR SEQ ID NO: 24: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 (il) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCGTAAGGAC TCCTAGAGCT ATT 

35 
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21 



24 



23 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCATCCATGT ACCGAAGG 



15 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(li) MOLECULE TYPE: cDNA 



20 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
ATGGGGTTCC CAA6TTCCCT 



18 



20 



30 (2) INFORMATION FOR SEQ ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

/C) STRANDEDNESS: single 
35 ^ 

(D) TOPOLOGY: linear 



# 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

5 

GCCGATATCA CCCGCCATGG 



(2) INFORMATION FOR SEQ ID NO: 28: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GCATCCGGCA ACTGCACG 



20 



20 



18 



(2) INFORMATION FOR SEQ ID NO: 29: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

35 20 
CGCGATGCTG GTTGGAGAGC 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: cDNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
15 TCTCCACTCC GAATATTCCG 

(2) INFORMATION FOR SEQ ID NO: 31: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(i_i) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
30 GATCTAGGCC ACTTCTCAGG TCCAGS 



2) INFORMATION FOR SEQ ID NO: 32: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 



20 



26 
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(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURES: 

(B) LOCATION: 6, 12, 19 

(D) OTHER INFORMATION: G represents inosine (i) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32 
CATCTGTTTG GGCAGGCAGT AGC 



23 



15 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTGAGCCAG TTCTCATACC TGGA 



30 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 



24 



9 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AGTGYTRCCM CARGGCGCTG AA 

10 (2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

^5 (C) STRANDEDNESSt single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GMGGCCAGCA GSAKGTCATC CA 
(2) INFORMATION FOR SEQ ID NO: 36: 

25 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
3Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

35 

GGAT6CCGCC TATAGCCTCT AC 
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22 



22 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
15 AAGCCTATCG CGTGCAGTTG CC 

(2) INFORMATION FOR SEQ ID NO: 38: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
30 TAAAGATCTA GAATTCGGCT ATAGGCGGCA TCCGGCAAGT 

(2) INFORMATION FOR SEQ ID NO: 39 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 50 amino acids 
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(B) TYPE : amino acid 
(ii) MOLECULE TYPE : peptide 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39 



5 



" .eu P.O 0.n Z .sp Se. Z HU .eu PHe 0. O.n ..a .eu 

35 *° 

Ala Oln 

50 

(2) INFORMATION FOR SEQ ID NO: 40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH . 150 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



20 



25 



(tl) MOLECULE TYPE : cDNA 

SEQUENCE DESCRIPTION : SEQ ID NO: 40 



GACTCTCAAT TCTTGTTTGC CTTTGAAGAT 
ACTGTTTTAC 

30 CCCCATCTAT TTGGCCAGGC ATTAGCCCAA 



60 



...CCCTXTT TCTGCATCCC TGTACGTCCT G„ ^^^^^^^^^ 
CCTTTGAACC CRACGTCTCA ACTCACCTGG ACTGTTTTAC 



(2) INFORMATION FOR SEQ ID NO: 41 

(i) SEQUENCE CHARACTERISTICS : 
•* (A) LENGTH : U a^i"" 
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(B) TYPE ! amino acid 
(ii) MOLECULE TYPE : peptide 

5 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 41 

Cys He Pro Val Arg Pro Asp Ser Gin Phe Leu 
5 10 



10 (2) INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 17 amino acids 

(B) TYPE : amino acid 

15 

(ii) MOLECULE TYPE : peptide 
xi) SEQUENCE DESCRIPTION s SEQ ID NO: 42 



Gin Gly Phe Arg Asp Ser Pro His Leu Phe Gly Glu Ala 

15 



5 10 



(xi) 

20 Val Leu Pro 
1 

Leu 
17 



25 

(2) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 8 amino acid 
2Q (B) TYPE : amino acid 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43 

35 

Leu Phe Ala Phe Glu Asp Pro Leu 



# 
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(B) TYPE s amino acid 
(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 41 
Cys lie pro Val J^rg Pro ^.sp Ser Gin Phe Leu 
- 10 
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10 (2) 



15 



INFORMATION FOR SEQ ID NO: 42 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 17 amino acids 

(B) TYPE : amino acid 

(it) MOLECULE TYPE : peptide 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 42 

« i.«o ser Pro His Leu Phe Gly Glu Ala 
20 Val Leu Pro Gin Gly Phe Arg Asp Ser Pr^ 

1 ^ 
Leu 
17 

(2) INFORMATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 8 amino acid 

(B) TYPE i amino acid 

(ii) MOLECULE TYPE : peptide 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43 
teu Phe Ala Phe Glu Asp Pro Leu 



30 



35 
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1 5 8 

(2) INFORMATION FOR SEQ ID NO: 44 

5 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH : 8 amino acids 

(B) TYPE : amino acid 

10 (ii) MOLECULE TYPE : peptide 

(Xi) SEQUENCE DESCRIPTION s SEQ ID NO: 44 



15 



Phe Ala Phe Glu Asp Pro Leu Asn 
5 8 



# 



(2) INFORMATION FOR SEQ ID NO: 45 

20 (i-) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

25 

(ii) MOLECULE TYPE : cDNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 45 
30 GTGCTGATTG GTGTATTTAC AATCC 

(2) INFORMATION FOR SEQ ID NO: 46 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1859 base pairs 
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25 
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(B) TYPE t nucleic acid 

(C) STRANDEDNESS « single 

(D) TOPOLOGY : linear 

5 (it) MOLECULE TYPE s cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46 

O^CCTGATTO GTGTATTTAC A.TCCTTTAT CT;^XCCGAA ATGCCCTGT TGCI^TATGG 60 
.0 ^GAAAOGG .CTTCCT..C CTCTGGGGC. ACCCCCATT. ..TACCACA. GTA^ATCATC 

OAGTTATTOC ACACAGTOCA AAAACTCAAG OAGCTGOAAC TCTTACACTO CCAAACCCAT 0 
"cAAAACGG AAGAGGOGAO AAGAGCAGCA TAACTOOCTA CAOACOCAAO GAAACACTAO 
^Zgaa AGAGAGAAAG ACACAOAAAG TCAOAOAGAG AGAGAOOAAG AGACAGAGCA 
CAAAOACGGA OTCAOACAGA CAC.GAC^C. GACACXCAOA OAGAAGGAAA GAGAGAOAGG 60 
15 AAGAGACAAA O^TOAAXCA A.CACAGAOA CAGAAAGTCA GAGAGAGAGA GAGAGAGOAA 420 
Tagacagaga AAAAGAGGGA GTCAGAAAAA GAGAGACCAA AGAAGAAGTC CAAAGAGA. 4S0 



OAAAGAGAGA TGGAAGTAGT AAAGGAAAAA CAGTGTACCC TATTCCTTTA AAAGCCGGGG 
.^TTTAAA ACCTATAATT GATAACTGAA GGTCTTCTCT GTAACCCTGT AACACTCCAA 
TACCACCTTG TTGTCAAGTG TAAACAAGGG CGTAGCCCAA AAGCACTGAG GCCACTAACA 



540 
600 
660 
720 



20 ^.T^^ r.ccn,'^ ^^r^'^ '^'^''^ 
^Jtt« ™cc».»c» .xccxcc^^c c»o»c™c »ao«TTccc T.c.ac»cco 

900 
960 



25 



30 



35 



TCTTAAT. A Aft A * **\*>»'«Tk A « w» • - 

O^T^CT OCTTCCTCCC A=OCC»TT« 0=0««UV« «c;U.TO«T »TT«>CT«> 
„CC«>C<=a= =«=»«=T T»=««TT OC««T«T TO^TTTCCTC 

..a^OTTCTT C«U>CCTTO. «=™CTTC» =«TCM=«* 0-«C.TC 

lcO«.T.C T»CT,»™ «0«:TC«C 0«=tTTT.T T««=»» 1= 

™t«cox. ttc«>cc«. =t».=ttto ct»«o™ »c»occt«c 

r^TC CT.CT^C .«CTCTC« «»»XTTCTC »3«C»=.T„= ..«=»»T» 4 
CCTT«TCT» C««CC^ -»CTCTTT 0=C.O.»=TC .CTCTCC»« .00 
CCT««CCTO CTC.C«CT= ..-^<»-<= 

-™«c. — ™- - 

ri«o=c tt™».tc ^o.c«.c=cc ,ttc«»ttc =t.t.cc«c ctct=c»ott so 

"^TO OT^T^CCC TTTCT.TOTC C««OCTCC CtCTTCCT. TT.CTCCCCT »« 

"cTc ™™ c,™ »™t^ ~- ™" ; 

L»0.0« CO^»C« -VTO»«CCCC «»T=«=CTC «CT«C»C TTCT^CTC.O = 
^^«» CC^OCeCCT C^XTTTC CT0OCCT»« O^OTTCOCCT CTOO.=«.C. 

rr^c Kooccc^c, c^cccc .^«o«cc «o.»-- ~- 
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10 



^„CC C»=.0«0= TOCCC^TCC COm«0-T COOC»^= »=.««-C 7« 
^rCTT CTCOCTCOCO TCO^OACTTC 0»0»C^ OTOTC^CT «»=0»™r .0= 
"ZZc .TC«T««C X««T«^C T»«T.C.CC »TC«»C 



(2) INFORMATION FOR SBQ ID NO: 47 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 23 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



15 (11) MOLECULE TYPE : cDNA 

(xl) SEQUENCE DESCRIPTION : SEQ ID NO: 47 
TGATGTGAAC GGCATACTCA CTG 

20 

(2) INFORMATION FOR SEQ ID NO: 48 

(1) SEQUENCE CHARACTERISTICS : 



25 



(A) LENGTH : 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



30 (ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 48 
CCCAGAGGTT AGGAACTCCC TTTC 



23 



24 



35 
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5 



10 



(2) INFORMATION FOR SEQ ID NO: 49 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH J 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE ; cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 49 
GCTAAAGGAG ACTTGTGGTT GTCAG 



15 

(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CAACATGGGC ATTTCGGATT AG 



30 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 



25 



22 



9 



0 
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(D) TOPOLOGY: linear 
(ii) MOLECULE type: cDNR 
5 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

„ ~ — zzz =: 

,0 o».oo«c„ co™ 

ro::rc = — — — 

„,»0C«OT »TOOTT.TCr «TCCC»«T =COC»«"= 



60 

120 

180 

240 

300 

360 

400 



15 



20 



t2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2389 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



35 



(ii) MOLECULE TYPE: cDNA 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

— " = =: = 

~ — ^C..» 0»OC^ 

,.«T«=OCC T«T=TT.OC C»0^oaC»C<= 

»«CT=«T «~C ^CC.™ ^^^^^^^^ 
„C,TTTTCO T<=.<=™.00. TTCCC«=»™ »=C«=^ ^^^^^^ 
„,C»O0.C »C.C»CO«0 CC^OT^CTC «— » ^^^^ 

C«=CCTTC«> «CCTT««"= 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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— » "° 

CT.TCMMI AATACMOM »MO»TCIC» ^-CTTMAI ICWMOKT MO 

""i^ :^rT ™™=ccc. c««»-c= »o 

= =: oo.c«^ ~ 
— — -™ — o.^.. ..o 

AGTATGCTTA TCTAATCCTA CATGCC ^ggAGTTATT GCACGCAGTG 1320 

-CCCC^. — ^ ,3,0 

'c::::ro ~- o-c^- °' — ^ 
= — — — - = To 

— : =: = — — 

~ — " — «™ -0 

COAGGOTGTA GAGCGCAGAC AGGGAGACCT ^^^^^^c CATACAAAGG 1920 

CCTTAACCCA GCAOCXTTTC TAAAAOOOGA TCXAAATCTT AATTAATTA ^^^^ 
^nr^oA ACTTCCTTCA GGACAGGATG ATAGATGGTT CCT^^^ui* 

^^eco. oea««»T o»«ooo« c«TWr«. 

5 o«r«.=« «T»^o. c.»T^ ^^^^^^ 

,„oo ^^^^^ .eo 

,„T.T=O.C «O^»00T TTr.T ^^^^^ ^^^^^^^ 

'i^r^ r~ ™" 

30 CAACACACTC TCANAGGATT CCTC« ^-.p.cTCTC 2389 

.GGAXAOTAA CTACAATCCC AAATACAXTC TXTGGCAGCA GXGACXCXC 



35 



(2) INFORMAXION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2448 baae pairs 

(B) TYPE: nucleotide 

(C) STRANDBDNESSs single 

(D) TOPOLOGX: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 53: 



...G CCCCCATCTA TTTGATCAGG CACTAGCCCA AGATCTAGOC CACTTCTGAA 60 
' :ZZZ C— CAGTATGTGG ATGATTTACT TTTGGCTACC AGTTTGGAAG 130 
CO crcCCA GCAGGCTACT TGAGATCTCT TGAACTTTCT AGCTAATCAA GGGTGTATGO 
CAXrAAATT GAAAGTCCAG CTCTGCCTAC AACAAOTCAA ATATCXAGCC CTAATCTTAO 3 0 
atIT^c CAGGOCCCTC AGCAAGGAAT GAATAAAGCC TATGC^OCT TATCGGCACC 0 
. ^AircrT AAAACAATTG TGGGGGTTCC TTGOAAXCAC TGGCTTTTGC CGACTATGGA 360 
' ™ — TA GCCAGGCCCC CTCTATTACT CTTATCAAGG AGACCC^OAG . 
GGrAAATACT TATCTAGTAT TATGGGNACC AGAGGCAGAA AAAGCCTTCC AAACCTTAAA 0 
ZrcCXA GTACAAGCTC CAGCTTTAAG CCTTCCCACA GGACAAANCT TCTCTTTATA 
™G AGAGCAGGAA TAGCTCCTGG AGTCCTTACT CA™ GAC.CCCCA 

nZm. »™~" ■■™-" z 
'■ ========= 

==5====""- 

35 AAAAGAGAGA ACGAAAGAGA CAGAAT armCAGTGT ACCCTATTCC 1620 

AGTTAAGAAA GTGAGAAAGA GAGATGGAAA TAGTAAAGAA AAAACAG 



PCT/IB97/01482 



WO 98/23755 



150 



10 



^ TAAAACGTM AATTTTATWV TTGGWVGGTC TTCTCCATAA 1680 
TTTAAAAGCC AGGGTAAATT '^^^^^^ ^,^,;^c AAGAGCATAG CCCAAAAGCA 1740 
CCCTATAACA TTAAAATACC ACCTTGTTGT CAGTOT^C ^^^^^^^^^^ ^^^^ 

COAGGCCAC .GACAACCCA .AGCC..- ^C-^ ^^^^^^^^ 
^CAGOGOA. C.AAATC.CA ACX^^- ^^^^^^^ ,„0 

CCCCCCAG GACAGAAGCA .^.^0- ^^^^^^^^ ^^^^^^ ,,3, 

^OXATTCA GTAAGXO..A AGGGAA- ^^^^^^ ^^^^^^^^^ ,,,, 

T^CGTCTGC TCAAATOTGC ^^^^^''^^ ^.oCTGAACAA GTTCTTATTA 2100 

,...COAAOG AGCCATCTAT ACCAATTCTa AGT^TA ^^^^^^^ ^^^^ 

...OCAAAGA ATCATTGAAA TCTCAAACTT GC^CTT^T ^ ^^^^^^^^^^ ^^^^ 

— — ^A " --^^^^'^ 

TCAGTGCCCC TCAAAGCTGA '^^^'^^^^^^ ^^,,oAGTAA CAGGTTTATC TACTTCATTA 2340 

.,.«..XAG GAAXGGCCAC — ,,00 

TCCTATTACC ACACACTCTT AAAGGATTTC -..cTCTC ^448 

.TCCTTACTC TNTARTCCCA AATAGHTTCX TTGCCACCAG TGACTCTC 



15 



(2) INFORMATION FOR SEQ ID NO: 54s 



20 



(i) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 (ii) MOLECULE TYPES cDNA 

(,1) SEQUENCE description: SEQ ID NO: 54: 

CCTGA6TTCT TGCACTAACC C 

30 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 



21 
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(C) STRANDEDNESS « single 

(D) TOPOLOGY: linear 

(ti) MOLECDLB TYPE: cDNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GTCC6TTGGG TTTCCTTACT CCT 

10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



23 



15 



(ii) MOLECULE TYPE: cDNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

TTCCTGACTT CTTGCACTAA CCTCAAATOA GAGAAOTOCC GCCATAACTG CAACCCAAGA 60 
: TroCGAT CCCTOGTATC TCAGTCAGGT CAATCACAGG ATGACAACAG AGGAAAG^XA 
25 ATGATTCCCC ACAGGCCAGC AGGCAGTTCC CAOTGTAOAC CCTCATTAGG ACACAGAATC 
CATTGGTGCC GCAGACATTT CCTAACTTGC G^CTAGAAO GACTAAGGAA 
rTrOGAAG ATATGAATTA TTCAATGATG TCCACTATAA CACAGOGOAA AOGAAGAAAA 
ZlorOOC TTTCTGGACA GACTAAGGGA COCATTOAGG AAGCATACCA GGCAAGTGOA 
CATTGGAGGC TCTGGAAAAC GGAAAAGTTO GCAAAAGTAT ATGTCTAATA GGGCTTGCTT «0 
30 C™. CTACAAGGAC ACTTTAAAAA AOATTGTCCA ATAGAAATAA GCCACCACCT 0 
rcCAXCCC CCTTATGTCA AGGGAATCAC TOGAAGGCCC ACTGCCCCAG GGGATGAAGG 5.0 
Tec CXOAGT CAGAAGCCAC TAACCAGATG ATCCAGCAGC AGGACTGAGG GTGCCCOGGG 
roCGCCAO CCCATGCCAT CACCCTCACA GAGCCCCAGG TATGCTTGAC CATTGAGGGT 
crGAloCGTA CTGTCTCCTG OACACTOGCO OOCCTTCTCA OTCTTACTTT CCTGXCCTGG 
35 TCCAOATCTG TCACTOTCCG AGGGOTCCTA GGACAGCCAG TCACTAGATA 

CrXCCCAO CCACTAAGTT OTGACTGGGG AACTTTACTC TTCCACATGC TTTTCTAATT S.O 



600 
660 
720 



# 
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^nn»r»rATT CTR6CAAAAG CAGGGGCCAT 900 

,^eco«. occc^cTC c^c^ — ^ „„ 
„..c.TO« ^« ,ec=,cc«T .0,0 

.„TCCT«« oTccooc^ « ^^^^^^^ ,™ec «80 



10 



15 



(2) INFOBMATION FOR SEQ ID NOs 57s 

(i) SEQUENCE characteristics: 

(A) LENGTH: 2391 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE type: cDNA 

,xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

r^r AGGGTGCCCG GGGCAAGCGC CAGCCCATGC CATCACCCTC 
.XOATCCAOC AGCAGGACNG AGGGTCCC^^ ^^^^^^^^ ^^^^^^^^^ 

.CAGAGCCCC AGGTATGCTT G^.- ^^^^^^ ^^^^^^^^ ^^^^^^^^^^ 

CGCGGKGCCT TCTCAGTCTT ACTTTC T ^^^^^^^ ^^^^^^^^^^ ^^^^^^^^^ 

.XCCGAOOOG TCCTAGG.C. GCC^G^ ^^^^^^^ ^^^^^^^^ ^^^^^^^ 

, TGGCO-CTT TACTCTTCCC ACATGCT ^^^^^^^ ^^^^^^^^^ ^^^^^^ 

TTGGGGAGAG ACATTCTAGC '^'^^^^ ^^,,;^.c CTGAAGTCCG GGCAACAGAA 
.CAACTGTTT GTTG^CC^ GC_ ~ ^ 

CGACAATATG GACAAGCAAA .CTCCAAAAG 
„CCT ACC^. ---- ^^^^^ .„T 
,0 ATTGTAAAGG ACCTAAA^CC CCAA ^^^^^^^^ ^^^^^^^^^ 

CCAATTTTAG GAGTAAGGAA ^'^^^'^^^ ,^«t;,cCTA ACCCTTATAC AGTGCTTTCC 

— - — - -^t^Ta ™CC TTAAGGATGC CTTTTTCTGC 
e^.ACCAG AGGAAGCAGA GTGGTrr- ^^^^^^^ ^^^^^ ^^^^^ 

MCCCTGTAC GTCCTGACTC TCAATT ^.XAGCCCCCA TCTATTTGGC 

CAGGCATTAO CCCAAGACTT GAGTCAAI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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.^n^i. CAAACCTTGT GCCATCAAGC CACCCAA6AA 1020 
^TGGATGATX TACTTTTAGT CGCCCGTTCA ^'^^^^^ ^eCAAAGGC TCGGCTCTGC X080 
CXCT..ACTX CTG.GGCX.C .AGGXT^^ K^CC- ^^^^^^^^ ^^^^ 

.CKC.GO.G. ™.T.CXK .GGGCX.... .T.XCC^ 0-0^ ^^^^^^^^ ^^^^ 

O^CGXAXCC GGCXX.XCCX -^^^^ ^„ 

..CCTTGOCA TAACAGGXrr CTOCCGI^ CAGATTCCC ^^^^^^^ ^^^^ 

CCATT.T.T. GG...CXC.G ™- ^^^^^^^ ^^^^ 

CCX.C.O..O XGGCXXTCC. GGCCCX...G ^^^^^^ 
^OCCAACG GGC..G.XTX GCC.C.GA^ ^^^^^^^ ^^^^ 

.GOXCXCGG G^.^^ ,3. 
C.TGTAGTGG CMAGGGTTG GCCTCATNGT TTATG ^^^^^^ ^^^^^^^^^ ^^^^ 

«...CXG..G C.GXX....T ..X.C.G- .0^-^^^^^^ ^^^^^^^ ^^^^^^^^ ^,3, 
OTGAACGGCA XACXCACXGC TAAAGGAGAC ^^^^^JJ ...cXXGXGC AACXCTXAAA 1740 

— — — "c rAGCCAA C_ 1300 
CCCAAACXXA XGCXGCCC.G AAGG.XCX ^^^^^^^ ^^^^^^^^^ ^3,, 

WVCXAXAXAX AXACXGAXGG AAGXXCGXXX ^^^^^^^cCC CCAGGGACCA 1920 

«CCAXACGXC TXACXGAXAA AGCAGXACXX GAAAGT^OC CXCXT ^^^^^^ ^^^^ 

„OX XAOCAGAACX AGXGG^ACXG ~- ^ _ _ 

CXXXAXCTAA XCCGAAAXGC C^XGXX ^^^^^^^^^^ ^^^^^^ ,,,, 

OGGGGAACCC CCAXXAAATA CCACAAOXTA AXCA ^^^^^^^ ^^^^ 

CXCAAGGAGG XGGAAGXCXX ACACTGCCAA ^^^^^^^^ ^.^^cAGA GAAAGAGACA 2280 
„GX GOCXACAGAG GCAAGG^G ACXAG^^^ ^O^^^^^^ ^^^^ 

=rx=A:= — ACAAAGAAXGA ^.l 



10 



15 



20 



25 



30 



(2) INFOKMAXION FOR SEQ ID HO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 (ii) MOLECULE TYPE: 



CDNA 



# 
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(Xl) SEQUENCE DESCRIPTION: SEQ ID NO; 58 i 

TGGAC^TAG CAGCATi^GT TGGCTGGCAG AlVGTAGGGA. AGACAGCAAG AAGTA^GAA SO 
^GiVGA ^GTCAO;«.A ^O.^^ GAGAGGAAG. AACAAAGAAG AACTTGAAGA 20 
5 o;,OAAAGAAG TAGTAAAGAA AAAACAGTAT ACCCTATTCC TXTAAAAGCC AGGGTAAATT 1«0 

TCTGTCTACC TAGCCAAGGC ATATTCTTCT TATGTGGAAC ATCAACCTAT ATCTGCCTCC 
CCACTAACTG GACAGGCACC TGAACCTTAG TCTTTCTAAG TCCCAACATT AACATTGCCC 
CAGGAAATCA GACCCTATTG GTACCTGTCA AAGCTAAAGT CCCGTCAGTG CAGAGCCATA 
CAACTAATAT CCCTATTTAT AGGGTTAGGA ATGGCTACTG CTACAGGAAC TGGAATAGCC 420 
10 GGTTTATCTA CXTCATTATC CTACTACCAT ACACTCTCAA AGAATTXCTC AGACAGTTTG 480 
CAAGAAATAA TGAAATCTAT TCTTACTTTA CAATCCCAAT TAGACTCTTT GGCAGCAATG 
ACTCTCCAAA ACCGCCGAGG CCCACACCTC CTCACTGCTG AGAAAGGAGG ACTCTGCACC 
TTCTTAGCGG AAGAGTGTTG TTTTTACACT AACCAGTCAG GGATAGTACG AGATGCCACC 
TGGCATTTAC AGGAAAGGGC TTCTGATATC AOACAATGCC TTTCAAACTC TTATACCAAC 
15 CTCTGGAOTT GGGCAACATG GCTTCTTCCA TTTCTAGGTC CCATGGCAGC CATCTTGCTG 

TTACTCACCT TTGGGCCCTG TATTTTTAAG CTTCTTGTCA AATTTGTTTC CTCTAGGATC 840 
CAAGCCATCA AGCTACAGAT GGTCTTACAA ATGGAACCCC AAA^AGTTC AACTAACAAC 900 
XTCTACCAAG GACCCC.GGA ACGATCCACT GGCACTTCCA CTAGCCTAGA GATTCCCCTC 9 0 
TGGAAGACAC TACAACTGCA GGGCCCCTTC TTTGCCCCTA TCCAGCAGGA AGTAGCTAGA 020 
20 GCGGTCATCG GCCAAATTCC CAACAGCAGT TGGGGTGTCC TGTTTAGAGG GGGGATTOAA OSO 
OAGGTGACAG CCTGCTGGCA GCCTCACAGC CCTCGTXGCV XCTCAOTOCC TCCTCAGCCX 1 40 
TGGTGCCCAC TCTGGCC^TG CTTGAGGAGC CCTTCAGCCX GCCACTGCAC XGXGGGAGCC X200 
.CXXTCTGGG CXGGACAAGO CCGGAGCCAG CXCCCXCAGC XXGCAGGGAG GXATGGAGGG 1260 
.CAGAXGCAG GCGGGAACCA GGGCXGCGCA XGGCGCXXGC GGGCCAGCAX GAGXTCCAGG 1320 
25 XGGGCGXGGG CXCGGCGGGC CCCACACXCG GGCAGXGAGG GGCXXAGCAC CXGGGCCAGA 380 
CAGAXGCrrGX GCTCAACXXC XXCGCXGGGC CXXAGCXGCC XXCCCCGTGG GGCAGGGCXV 1440 
CGGGAACHXG CAGCCXGCCC AXGCXXGAGC CCCCCACCCC GCCGXOGOXX CVXGCACAGC ISOO 
CCAAGCTTCC CGGACAAGCA CCACCCCXTA XCCACGGXGC CCAGXCCCAT CAACCACCCA SO 

Igxxoagg agxgcgggca cacagcgcgg gaxxggcagg cagxxccacx xgcggccxxg 

30 GXGCGGOAXC CACTGCGXGA AGCCAGCTGG GCXCCXGAOX CXGGXGGGGA CXTGGAGAAT 1S80 
CXITATGTCT AGCXAAGG6A IXGIAAAXAC ACCAAXCAGC AC 



35 



(2) INPORMAXION FOR SEQ ID NO: 59: 
(1) SEQUENCE CHARACIERISTICS: 



§ 
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10 



15 



20 



25 



30 



(A) LENGTHS 495 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS s single 

(D) TOPOLOGY « Linear 

(11) MOLBCOLB TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

CTTCCCCAAC TAATAAGGftC CCCCCTTTCA ACCCAAACAG TCCAAAAGGA CATAGACAAA 60 
CGAGXAAACA ATGAACCAAA GAGTGCCAAT ATTCCCTGGT TATGCACCCT CCAAGCGGTG 120 
GGAGAAGAAT TCGGCCCAGC CAOAGTGCAT OTACCTTTTT CTCTCTCACA CTTGAAGCAA 80 
ATTAAAATAG ACNTAGGTNA ATTNTCAGAT AGCCCTGATG GYTATATTGA TGTTTTACAA 240 
CGATTAGGAC AATCCTTTGA TCTGACAXGG AGAOATATAA TATTACTGCT AAATCAOACO 300 
CTAACCTCAA ATGAGAGAAG TGCTGCCATA ACTGGAGCCC GAGAGTTTGG CAATCTCTGG 360 
TATCTCAGTC AGGTCAATGA TAGGATGACA ACGGAGGAAA GAGAACGATT CCCCACAGGG 420 
CAGCACGCAG TTCCCAGTGT AGCTCCTCAT TGGGACACAG AATCAGAACA TGGAGATTGG 480 
TGCCGCAGAC ATTTA 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2503 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



35 



CC«C«>C=C .CC^TTCCC C«.««T,X ,»CO.C«C C»OCC^ 

cc«acoc« «»c«>T«=c «=«c«=«» O.CCTTTCCC »T«=c«c c.TcxaT.c 0 

r»a.CCC CTTTC^CTTC CT«TCT.TC CX«=TT,CT TT.C^TTCO =O0OCT»«T 

™tc ^.c»oo»t r,»^r^ ™cc oco=.cx« 



# 
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CT««»« =CTCTCT«C ^CCCCCMC TC«,«»T «OO.CC=^ 300 

====== 

n.rTGGMLar CTCAAAAGCA TGTCGCCCAA GCGACACTCG CCTATi,i«. 

„ ^n-rrnv-rCTC CTAATGCCTG CCAGACAAAC TTCCTCTCGC CTCTCTTCit- 

^ =:=_c^^~r= ::: 

CTATAAAGAA TGAWTTCTAG TATTAAACTC CAGGACloi ^00 
CTATAAA^nA „^^„*r^rcc CATCGTAGTG GGGACTACCT 720 

m^»r**iiiinar ACAGTTTTTG CCCAAGGCCC V^ai^via""* 

GGAATTTTAG ^^^^.tCCCTC CTATTCATAT AAGTGAGAAC AAAAGGTGTC 840 

" ™: ".c cc_ „ca ™. - 

T^Z ATCTTTATAG GATGGGGTAA AOTCCCAATA CTAACAGCAG AATGCTTAGG 960 
ZZ AXGCCCAGX AAGGOCCACX AAAXCXGAT. TTTCXCAG.C XO.O 

ACTCTAACAG GTTTTTGAGA .^^y^^OOK GGTTTTCGAG AATGCGTCAG 1080 

PGTCCTCCTT GTGGTCTAGG AGGACAGGCA AGGTTGTGCA ^ 

....TCCGAC CTTCCTCGGT CCTCCATGTG GTCTGGGAGG AAAACTAGTG 1140 
15 TAAGGACCAC TAAATCCGAC „eAAGTCAG CAGGGTCCAG GGACCGTTGC 1200 

TTTCTGCTGC TGCGTCGGTG AGCGCAACTA TTCAABU^« 

^.CTTCO 0C«===™ «COCT=C T=««C^ » 
«»C«TT=C .OCTCCTTOO C»<»0.C>O ^C^^^C. 3.. 

T™. T^rc^ ~" 

==-iEi===':ro 
= = =: = = = :::: 

TCAAGVTGTA GGGGGAGGGG ^"^^^ ^,^,^,eAG ATGATCCTGA TAGGTACATA 1920 

— :::::::: ™ ogagagacg. ca™ .so 
rGGcc. _ AA— — — :::: 

ogagatacct ggtatcctag tcaagtaaax gaaagaatc. c^gccg^ag^ ^^^^ 

„CCXXACXG g™c CAXCCCCAGX A.^^ — ^^^^ 
C^VTGGGGACT GGAGTCGTAA ACATCTGTTG ATCTGTGTTC ^^^^ 
gggaaaaagc — xa xxc^xg^^ x™ c^caggg^^^ ^^^^^^^^ ^^^^ 
CCTTCTGCCT TCCTCGAGCG GCTACAAGAG ^^^^J^ ^^cCCAATC AGCCACAGAT 2400 
GAATCACTCG AGGGTCAATT GATTCTAAAA GATAAGTTTA TTACCCAA 



25 
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«CAAGCCCTO AOCCTOAACA AWVTCTMAG AOVTTATTAA 2460 
ATCRGGAGWV AGCTCCAAAA GCAA6CC0T0 RO ^503 
icCXGOO^C CTTOGTOTTC T.T.AX.GGG .CC..G.GO. AC 



10 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1167 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

^^ccc .^^^ ccx^coc^ o«^« 2^.^ «o 

O.CTTTTCTT T«I»TOTC»C «.»««CM GM«<=C „^CCMA== 240 

, ^.c«c c«»c«c=. ™cc ~ -™ 

= ~. c.™^ —r: = To 

^^-.^o/^nr- TGCACATTTG TGCRACTCTT AACCCAGCCA CAIli^i^ 
TGAAGTGCCA GTGCTGCGAC TGCACATTTl, ^^r^^r^.. CCTATGCTGC 540 

„™CTT<=» «.»tCO=C T»=TC»=0 «c™«=T CKCCT 
JO OTATGCTTAC CTAGTCCTCC AlO caqatTATTA TTGGCTGTAC 900 

^™«c= c«.coc»« ^^^^^ ^^^^^ 

CmMJlMT OCTTMMM OOftCCCOTM .™c»TACT TTCCTCCCCT 1140 

55 CCCMTOCIC AGtaOGMM OTMMWGG WUCCIIJ. 

1167 

CCAGATGGCT agccactgag gaaggaa 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 62: 
,S XCC»«OC. C.C^^^ ™CC C.T.C^CC .T.TCCCT « 

CCCAAAACCC TARAGCRA 



(2) INFORMATION FOR SEQ ID NO: 63 

20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 26 amino acids 

(B) TXPE : amino acid 

25 (ii) MOLECULE TXPE : peptide 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 63 

30 - ^ 



Ala Tyr Pro His Pro Lys Thr Leu Lys Gin 



25 
20 



35 (2) INFORMATION FOR SEQ ID NO: 64: 



# 
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(i) SEQUENCE characteristics: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
g (D) TOPOLOGY: linear 



(i 



ii) MOLECULE TYPE: cDNA 



15 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AAGGGGCATG GACGAGGTGG TGGCTTATTT 

(2) INFORMATION FOR SEQ ID NO: 66: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
/D) TOPOLOGY: linear 

35 



PCT/IB97/01482 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

10 28 
AAATGTCTGC GGCACCAATC TCCATGTT 



30 
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(ii) MOLECULE TYPE: CDNA 

(Xi.) SEQUENCE description: SEQ ID NO: 66: 
GGAGAAGAGC AGCATAAGTG G 

5 

(2) INFORMATION FOR SEQ ID NO: 67: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 (ii) MOLECULE TYPE: cDNA 

(^i) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GTGCTGATTG GTGTATTTAC AATCC 



20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleotide 
25 ^ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
GACTCGCTGC AGATCGATTT TTTTTTTTTT TTTT 



J5 (2) INFORMATION FOR SEQ ID NO: 69: 
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10 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPES nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
GCCATCAAGC CACCCAA6AA CTCTTAACTT 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
25 CCAATAGCCA GACCATTATA TACACTAATT 



30 



30 



(2) INFORMATION FOR SEQ ID NO: 71: 

JO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 



9 
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J,,) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
GCCATARCT6 CAACCCAAGA GTT 



5 

(2) INTOKMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GGACGAGGTG GTGGCTTATT TCT 



20 

(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
AACTTGCOTG CTAGAAGGAC TAAGG 



35 

(2) INFORMATION FOR SEQ ID NO: 74: 



# 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 base pairs 

(B) TYPES nucleotide 

g (C) STRANDEDNESS : single 

(D) TOPOLOGY! linear 

(ii) MOLECULE TYPE: cDNA 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

AACTTTTCCC TTTTCCAGAT CCTC 

15 (2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GCATACCAGG CAAGTGGACA TT 

30 (2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

iC) STRANDEDNESS: single 
3d * ' 

(D) TOPOLOGY: linear 



PCTAIB97/01482 



24 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 76: 



( 

10 



2) INFORMATION FOR SEQ ID NO: 77: 



PCT/IB97/01482 



5 25 
CTGTCCGTTG GGTTTCCTTA CTCCT 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

20 24 
GAGGCTCTGG AAAAGGGAAA AGTT 



(2) INFORMATION FOR SEQ ID NO: 78: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

35 25 
CTGTCCGTTG GGTTTCCTTA CTCCT 
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(2) INFORMATION FOR SBQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
15 AGGAGTAAGG AAACCCAACG GACAG 

(2) INFORMATION FOR SEQ ID NO: 80: 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 (ii-) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TGTATATAAT GGTCTGGCTA TTGGG 

30 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 81: 
AGGAGTAAGG AAACCCAACG GACAG 



10 

(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



20 



(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TTCGGCAGAA ACCTGTTATG CCAAGG 



25 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



35 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



wo 98/23755 
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CTCGATTTCT TGCTGGGCCT TA 
5 (2) INFORMATION FOR SEQ ID NO: 84: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



22 



10 



15 



(ii) MOLECULE TYPE: cDNA 

(^i) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



20 

GTTGATTCCC TCCTCAAGCA 



20 (2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



25 



30 



20 

CTCTACCAAT CAGCATGTGG 
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35 (2) INFORMATION FOR SEQ ID 



wo 98/23755 



# 

PCT/IB97/01482 



10 



168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



19 

TGTTCCTCTT GGTCCCTAT 



(2) INFORMATION FOR SEQ ID NO: 87: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 aminoacids 

(B) TYPE: aminoacid 

20 (ii) MOLECULE TYPE: peptide 

Met Ala Thr Ala Thr Gly Thr Gly He Ala Gly 

5 ^° 
^ . T>ho qar A8t> Ser Leu Gin Glu 

25 ser Tyr Ty. His Thr Leu Ser Lys Asn Phe Ser A.p 

20 " 

c Tie Leu Thr Leu Gin Ser Gin Leu Asp Ser Leu Ala 
He Met Lye Ser He Leu Tnr 

40 

1 - «' »" 7 

" ... :l o« cv, ... 1 ^ <=.v 0.. 'C. c. ... - 

70 ^ 

1 s.. X.. v.. ... - - 

Tvr Thr Asn Leu Trp 
35 Ala ser Asp He Arg Gin Cys Leu Ser Asn Ser Tyr 

100 
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ser Trp Ala Thr Trp Leu Leu Pro 
115 



169 

Phe Leu Gly Pro Met Ala Ala He 



120 "5 



130 "5 

155 

1AR 150 

Z p~ - - 'Z 

170 

165 



^ S« ... S„ T« OXu X.. - « 

180 

195 200 205 

„. v.. u. u. P.O s„ s.. «p o., 

P.. r, o., n. V. ... cv. u ... -u - 

230 

..P X.. ... v.. O.. C= «P P" 

C s.. Z ... «. cv. »- v.. O.V ». - P- 

" ... s.. P.O - "V - - 

280 285 

o,, «P "V -V - -V - 

::: s.. r: «. - - - - - - 

310 

::: ... ™. «. - - - - - 

325 

se. s„ - - - - - 

" ... "°. - s.. p.o p.o p.. 

355 360 

, H«« HtB pro Leu ser Thr Val Pro 
HiB ser pro Ser Phe Pro Asp Lys H.s H.b 

Asn His pro Irg Val Glu Glu cys Gly Hie Thr Ala Arg 
35 ser Pro He Aan Hxs pro « y 

390 ^ 

385 
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„^ c.. »a v.. «o ... ». "° 
405 "° 

«. s„ '« "° 



425 

420 



Val 
433 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



20 



25 



30 



CXTCCCC»C T«T»OO.C CCCCCT.TC. ^C.O^C.O .OC^^ C.T.O»C«» 0 

^■^c. — ° 

XC0OCCC.CC C.O»OXO»T CT^™ CTCTCXCC. CXTO«CC« «0 
.C^»COX»» .TX,^«T «.OCCTa»TC CVX...T.O. «T«T.C« 4 
:l".C »TCCT™. TCTO.C.TCC «»»»T« T»TT.CXCCT »»TC.O.CC 0 

TCCTOCOT. .CTOC^^C C.COTXTOC C«TCTCT 
^^.OXC »««»™ T»00»O.C» »CCC«CO». O.C«CO«XX CCCC»C»CCO 4.0 
^LoOCAG TTCOC«=TOT AOCTCCTC.T TOOO.C.C»C «.TC.a«>C« TCO»»TT0O 4=0 
^^^C l^.CT^»C TTOCOTCCX. OM=O.CT« <.=«MCT« 0«O»CT»T0 S40 

:™ ~c x™ ooo«.oo» cxocc^' - 

:™ .OOO.OOC.X — c. x.cc^c« oxco^cx^ 0-cxcxoc s» 

AAAAGGGAAA AGTTGGGCAft ATTGAATQCC TAA 



35 (2) INFORMATION FOR SEQ ID NO: 



89: 



• 
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(i) SEQUENCE CHARACTERISTICS: 

(A) length: 1577 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



^0 ^y.r.m.nn^r'V ATGAATTATT CAATGATGTC 60 

^c^ccxc c^^^ T^oo'-c 

o«.^- c~ ^^^^^ ^^^^ 

™ L~ »-«o.c=c »o 

„««^o ™o«»..= oc™ ^^^^^ 

OMOGCOTM TOCCCCGOG G»OGWGGTG '^'^ „„cCATC» CCCTOMMC 4J0 

G„X GC™ .^-^ ™ ^ 

COTTCTCMT CTOClMCO ■TCTCCOl.GAC WlTTGT oAOTCGGGM 600 

,„ CGGTCCT^G .CCT^ TOTCTC^G CX«=.^T 

— :™ :rr: — gc„ 

-C^" ,„„O.C»T 7S0 

MGG»CiAGC MJGMTGOC CGtCCTGITG ^o^ciCHM MMTGTTM 900 

rG^: = — := :::: 

„CTG^« C=tC»«=C C.GCXOX.TC ™" ™ ^^^^ 

— rrGC» .^C„^ 
,0 AGAICCTOM TCTC«TTCI TGTTTGTCTT ' ^^^^G GCCAGGCATT 1260 

,™„C C.™ G^«J 

Z ™^ -CC.GGGC CCTCTGTGXG G^^.TGC UOO 

35 TTAAATACTT AGGGTTAAAA TT« .cTAAGWVGG TCCTTGGCAT 1560 

^CCTGTACT GGCTTATCTT C.TCCCAm CCCTAAAGCA ACTAAGAAGG 



# 
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1577 

AACAGGTTTC TGCCGAA 



(2) INFORMATION FOB SEQ ID NOs 90: 

5 

(i) SEQUENCK characteristics: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 



10 



35 



(ii) MOLECULE TYPE: peptide 

(^i) SEQUENCE description: SEQ ID NO: 90: 



JB Gin Pro Met Pro 



S„ s.r se. «, T-r olu Oly «■ '^^^J W 

" L «o - L« v.. cv. ... 1 n. - 

25 

20 

35 *° 

P„ OU s.. s.. «, se, ... X. v.. 

:L v.. - - ..P 

70 ''^ 

. Ti= Met Pro Glu ser Pro Thr 
Thr Leu Leu Phe Ser His Ala Phe Leu He Met 

85 '° 
25 , «ia TVS Ala Gly Ala He He His 

P.O Leu Leu Gly Arg Asp He Leu Ala Lys 

105 

n. r, oxy xx. P« XX. c c «o X.. ox. «x. 
30 XX. ..n V.X .X. XX. =X. OX, OXn CX, CX„ «. 

::: ::: ».x - - - - - 

150 

^"^^ . Pro Glu Ala Leu Gin Gly Leu 

tiv.rt TVS Tvr Pro Leu Arg Pro ^j-u 
Pro Tyr Gin Arg Lys lyr ri. 

170 "^^ 

165 



Lys Arg Leu Leu Arg Thr 



t 
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180 



(2) INFORMATION FOR SEQ ID NOs 91: 

5 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE; nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



10 



ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AGATCTGCAG AATTCGATAT CACCCCCCCC CCCCCC 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 92: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

30 22 
AGATCTGCAG AATTCGATAT CA 

(2) INFORMATION FOR SEQ ID NO: 93: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleotide 



t 
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(C) sTW^DEDNESSt single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 



TCCGCGCA aO.CTG.GGG TGCCCCGGOC AAGTGCC.OC CC.TGCC.TC 

CCCCOGGT.T GTTTO.CC.T TC.O.OCC.O 
o""cTGG. CCTGOCGC. GCCTTCTC.0 TCTX.CTTTC CTGTCCCG. 
l^TCCT CC.O.TCTGX C.CT.TCCO. GGOGTCCT.G G.CGCCGT 
TcT^CT^C TTCXCTC.GC C.CT..GTTG TG.CTOGGG. .CTTT.CTCT 
rJcCXGC TTTTCT..TT .TGCCTGM. OCCCC.CTCC CTTGTT.GGO 
rlcTTT T.GC....GC .OGGGCC.TT .T.C.CCTG. .C.T.OO... 
^Z.CCC .TTTGCTGTC CCCTGCTTG. GG..GG..TT ^TCCTO..O 
Tt^CKT .G..GG.C.. T.TGO.C^O C^O^TCC CCOTCCXGTT 

^rr^c t;^gg.ttc tocctccttt ccct.cc... og..ot.ccc 



:r^«C= CC™..«« C~» ™« ^CTCC^T. «0 



AAGCCCAAGG cuxH^ix*™^ 

„»C=K»™ <=0«^COC«, C=G.C=T«= «><.»TCTC«= 

L™tt«t =«octc«t ttcotctw^ =cc«.ct=t« TCT^CCOCTT 

rcXCTCCT T,CCCT»« OC^^ ''"'^^^ 
LcC««CO .TCCCT^T. CTC»TCC=T .T«:OT=CT<; .CTCTC^TT 
C^™=CC TTTO»«.TC =TTT=«CCC «COTCTC^ =T=.C=TO<=» 
C„^.CC CC^OCTTC ^«=<=.™CC OCmC«TT ,OOC«=C« 

rrMCCc»c »ctto»o,» attctc.™ cto«»c»ctc ttotccttc 

.„..^.T ..T^.CT,. ,TC.=«.CC T,=T=C^.. 

„CCC.CCC. .C«CTCTT. »CT«CCTC. CT»=CT=T=0 ™C«=TT 
rlcC^ .OOCTCOOCT CTCCT»C.a O^.TT.O.T .«T»=CC 

in^^TCc ««ac«c. «oc==T«o TO^»C=T ..cc»=cc.^ 

^CT,» ^-CC ««CCCT» »CC«CT».C .CCCXTCC- 
™C»0 CTTTCT^CCO ««C««TT CCC«=CT.C. CCCC«««C 

, ™™ «T.C»CT« ^ooc^c .c.a«.=cc «™ 
r«T»«.TO c.»cc»« o«,=.aoc^ TCCCOCCCT »««»cc 

rrCC»C C».=^=CC. .COCOC^ .™TCr^ 

""=C<=.« =«««A«0 0«,T«=CT« «O.OTCCTT ^C0C»=OT« 

rrr-o cttcc^ccc =t=.™»cc — » ~« 

, OTTCCCC^. TTC^«- OT«,C,=COO C.=T«C.<= 

0««»CT.» 000«0».« =.T.-X<=T 
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GGACATCTCA 
TTGTCAGACA 
AGTGCTGAGA 
CAGACAATGA 
5 ACCTATGCTG 
CCTCAACTTG 
AAGCGGGGTA 
CTCACTCCAG 
GGCACTAGAA 
10 AGTATGCTTA 
GAATTCCTAA 
GAGATTATTA 
GCCAGGGTCA 
GCGGATATTG 
15 GCTT 



TGATGTGAAC 
ACCATTTACT 
CTGCGCACTT 
AGAAAAGATA 
CTCGAGGGGA 
TATACTGATG 
TGCAGTGATC 
GAACTAGTGC 
TTAGGAGAAG 
CCTAGTCCTC 
CTTCTGAGGG 
TTGGCTGTAC 
TCAGGAAGAA 
AAGCAAAAAA 



GGCATACTCA 
TAATTATCAG 
GTGCAACTCT 
GAACATAACT 
CCTTCTAGAG 
GAAGTTCCTT 
AGTGATAATG 
TCACCTGGCA 
GAAAAAGGGT 
CATGCCCATG 
AACACCTATC 
AGAAACCTAA 
GAGGAAAGGG 
AGCCGCAAGG 



CTGCTAAAGG 
GCTCTATTAC 
TAAACCCGCC 
GTCAACAAGT 
GTTCCCTTGA 
GGCAGAAAAA 
GAATACTTGA 
GAACTAATAG 
AAATATATAT 
CAGCAATATG 
AACCATCAGG 
AGAGGTGGCA 
AAATAGAAGG 
CAGGACTCTC 



AGACTTGTGG 
TTGAAGAGCC 
ACATTTCTTC 
AATTGCTCAA 
CTGATCCCGA 
GGACTTCGAA 
AAGTAATCGC 
CCCTCACTTG 
TCAGACTCTA 
GAGAGAGAGG 
AAGCCATTAG 
GTCTTACACT 
CAATCGCCAA 
CATTAGAAAT 



20 



(2) INFORMATION FOR SEQ ID NO: 94: 
^i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2364 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE description: SEQ ID NO: 94: 



25 



30 



35 



ATGATCCAGC 
CATCACCCTC 
GTNACTGTCT 
CCTGGACAAC 
GCCAGTCACT 
TACTCTTCCC 
TTGGGGAGAG 
AGGAGAAGGA 
CTGAAGTCCG 
CCTGTTCAAG 
GTACCCCCTC 
ACCTAAAAGC 



AGCAGGACNG 
ACAGAGCCCC 
CCTGGACACT 
TGTCCTCCAG 
AGATACTTCT 
ACATGCTTTT 
ACATTCTAGC 
ACAACTGTTT 
GGCAACAGAA 
TTAAACTAAA 
AGACCCGAGA 
CCAAGGCCTA 



AGGGTGCCCG 
AGGTATGCTT 
GGCGGNGCCT 
ATCTGTCACT 
CCCAGCCACT 
CTAATTATGC 
AAAAGCAGGG 
GTTGTCCCCT 
GGACAATATG 
GGATTCCACC 
CCCAACAAGA 
GTAAAACCAA 



GGGCAAGCGC 
GACCATTGAG 
TCTCAGTCTT 
GTCCGAGGGG 
AAGTTGTGAC 
CTGAAAGCCC 
GCCATTATAC 
GCTTGAGGAA 
GACAAGCAAA 
TCCTTTCCCT 
ACTCCAAAAG 
GCAATAGCCC 



1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2304 



CAGCCCATGC 
GGTCAGAAGG 
ACTTTCCTGT 
TCCTAGGACA 
TGGGGAACTT 
CACTCTCTTG 
ATGTGAATAT 
GGAATTAATC 
GAATGCCCGT 
ACCAAAGGCA 
ATTGTAAAGG 
TTGCAAGACT 



50 

100 

150 

200 

250 

300 

350 

400 

450 

500 

550 

600 
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=«=i«=o« »co=«c»=. «»T«a«c, T»OT«»«» «o 

rrOCKTT .rC«.C«« CTC^OTTCC CCTOT^CCT. 

TccTTOT^C .=T00m=C =«»T«»« »=O"C»0. CTOC,TT«=» «0 

Tc^^CC TT»«.»TOC C,«T«=C ^TCCCTOT.C OTCCT^.CXC SOO 
3 rcTTCTTO T^CCTT^ MO^^TT ».=C=«CO TC.C»CT« 0 
C^CTOT m»=CCC«> OCCTTC^OO .T«K:CCC^ -™ 0 

"c^cc,.- ccc»«c« o.xc«™ ~= .™ 

„™W«C »TOG»TOATT T»CTTTT«=T CGCCCOTTC. OMXCCTTCT 
C.CCC«,=« CTCT^^C^ TCCTCCT.C =TC^=T.C 

10 '^^"'^'^ ■^'^^^ '"^^^ o^^^i^'^oc »=o 

^.CCT^WCr OOCTTMCCT C»TCCC«>« "^^""^ ^^^^CC 1»0 
— •i.^.roTrP CTOOCOWIM CMMTCCCa 0(!TllCR5C<->- 
™. cClTTiram CUCTMTTMl OOWUlOTCSG HWVOCCMTA 
15 CCTHITTMT «OATCO»<a CCTKC»=«0 TOCCTTTCn. 

««OCCCT« CCC«CCCCC .CTCTTC.CC TT.CC««0 00C«=^ 
C=C»C»C»« .«0.0.^T »CCTCT.«» OTC^T^ 
»0CTCTCA0C C«CCC0TGO TOWCCTOM t»GO»«TT 

rl^G c»«o..™ occc^o. - 

TTGT6GTTGT CAGACRACCR TTTR cCCAAACTTA 1750 

.^^r-i.^.o rCACTTGTCC AACTCTTRAA CCCAAAl-J.i« 
AGAGCCAGTG CTGNGACTGC GCACTTGT ccCTGACCTC IBOO 

TCCTGCCCAG AAGGATCTTT NTAGAGGTCC CCTTAGCCAA ^^^^^^^ 
...^^ :xAGTTCGTTT GTAGAAAA6G GATTACAAAG 
» '«'"^™"'""ro ^-«T O«.0T«0C »00 

°°""=Tc "c^rr rcLcc^T ™c«i«c. «l.oocc« 

CTCTTCCCCC CCAGGGACCA nTCTGTATAC 2000 

^=0^^— - 

.0««««0 «T0CTT»T0 T«TCOO«* T ^^^^ 

,0 ^■-^•^ -"'^'"^ ^'''ZZ Z^c^o. ^c.o.^-' "=0 

TOXTatoe AGIT.TTO» »«OTOCM «ACTCMOO 
^^„C C«»OCC.O .O^OOO. »0.0.0«. 

..^0.. ^o.=,«c. ^ ^^^^ 

35 CAGAGAGAGA GAGAGACAGA GAGTCAGAU ^^^^ 
GAGACAAAGA ATGAH 
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5 



(2) INFORMATION FOR SEQ ID NO: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 amino acids 

(B) TYPE: peptide 

(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 95: 



SSSRTBGARG KCQPMPSPSE PRVCLTIESQ EVNCLLDTGA AFSVLLSCPR 
GVI.QPVTTV FSQP.SCOWO T..FS^..X HPESPTP..G 

roiLAKAGAI IHLN 



50 
100 
150 



KCO.»SP=- ""•^"■'^ 
a„SMVTI» OVLCQPVTT. rSWL=CO»0 TtLFSB«I,I HPESPTPLLO 

10 LuaCAOX: IH.»IOKC„ ICCPLLBBCI -PZ™.-™ V«!«»«<^ 

rVK.K.S«. PVQBKV™. ..I«.X,«V mX«».LW PCSSPO^PX 0° 
ZcP^O." «>V.PL,P»V SSPVT.,..! P..^ 

rr^PCtP V,-OS«P«» PHOP.»PTS0 L^TVLPOOP ° 

r»^»ps. — : 

15 ^K»Lsa .mLOLKLS KOTP.!^-" 

o"Xk=i p.«PX»- " 

l^S„ T«OP=L.^. .KX<=X«.V. T=V.=»S.«P V«^-X.V SO 
V^BC.» V««V»V.VS H.V.X,0=» ."™S»™ OX.T«OO.W 

r»»..Bv, ^.-=p-p ° 
rr^.^^ 

ADIEAKKAAR QDSPLEML 

25 (2) INFORMATION FOR SEQ ID NO: 96: 
(i) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 114 amino acids 

(B) TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



35 (2) INFORMATION FOR SEQ ID NO: 97: 
(i) SEQUENCE CHARACTERISTICS: 



t 
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(A) LENGTH: amino acids 

(B) TYPE: peptide 

,.1, SMOEHOB DESOMPTIO"' SM I» »0. 97. 

„m,»PFCIPVRPDSQPLP«..=P..»P-Sa'-™™'e='"'=™''°* 
■,»,..«.FSVL.™Mi™OLI^V«.S.TI,CHS.T<!«.LT,LTTC=«,.V 

,0 SKPKMLC«««VL="aSKOtP«.SEm»PILMPHPm«»«=.L 

.Xpvps.p.W.p-"™''-''"-"'-"'"""'''""" 

„.-rSE«PI»»QE«»UXL»V<,KPK.V«VX.CQ.«,..EE,«XBa»«, 
ADIEAKKAARQDSPLEML 

(2) INFORMATION FOR SEQ ID NO: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: amino acids 

(B) TYPE: peptide 

as <.i, SES».»=E OESCEIPtlO". .K! ID >">' «■ 

^Cs«a*E.I^T,«.«=EO.KV»...0SKTM.VL„^«^".EK 

30 RDIEAKKAARQOSPLEML ' 

(2) INFORMATION FOR SEQ ID NO: 9^ 
(i) SEQUENCE CHARRCTERISTICSt 

(A) LENGTH: 23 base pairs | 

(B) TYPE: nucleotide 

(C) STRMJDEDNESS: ^ngle 
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(D) TOPOLOGY: linear 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGGAGTAAGG AAACCCAACG GAC 

5 (2) INFORMATION FOR SEQ ID NO: 100 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS! single 

(D) TOPOLOGY: linear 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TAAGAGTTGC ACAAGTGCG 

(2) INFORMATION FOR SEQ ID NO: 101 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base paJ.rB 

(B) TYPE: nucleotide 

(C) STRANDEUNESS: single 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TCAGGGATAG CCCCCATCTA T 

(2) INFORMATION FOR SEQ JD NO: 102 
(1) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTIf: 24 base pairs 

(B) TYPE/ nucleotide 

(C) STRAIfDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESmiPTIONs SEQ ID NO: 102 
30 AACCCTTTGC CACTACATCA ITTT ^ 

(2) INFORMATION FOR gfQ ID NO: 103 
(i) SEQUENCE CHARACTERISTICS: 

(A) LTNGTH: 18 base pairs 

(B) ITPE'- nucleotide 

(C) fTRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
AGCAGCAGGA CTGAGGGT 

5 (2) INFORMATION FOR SEQ ID NO: 104 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

( B ) TYPE : nucleot ide 

(C) STRANDEDNESS ! single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CTGTCCGTTG GGTTTCCTTA CTCCT 

(2) INFORMATION FOR SEQ ID NO: 105 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GACAGCAAAT GGGTATTCCT TTCC 

(2) INFORMATION FOR SEQ ID NO: 106 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 
30 AGGAGTAAGG AAACCCAACG GACA 

(2) INFORMATION FOR SEQ ID NO: 107 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 



# 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
TGTATATAAT GGTCTGGCTA TT6GG 

5 (2) INFORMATION FOR SEQ ID NO: 108 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
TTCGGCAGAA ACCTGTTAT6 CCAAGG 

(2) INFORMATION FOR SEQ ID NO: 109 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGCTCTGCTC ACAGGAGATT AGATAC 

(2) INFORMATION FOR SEQ ID NO: 110 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
30 AAAGGCACCA GGGCCCTCAG TGAGGA 

(2) INFORMATION FOR SEQ ID NO: 111 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: base pairs 
25 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 



25 



26 



26 



26 
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(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

28 

GGTTTAAGAG TTGCACAAGT GCGCAGTC 

5 (2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
GCTTATAGAA GGACCCCTAG TATGGGGTAA TCCCCTCTGG GAAACCAAGC CCCAGTACTC 
A6CAGGAAAA ATAGAATAGG AAACCTCACA AGGACATACT TTCCTCCCCT CCAGATGGCT 120 
15 AGCCACTGAG GAAGGAAAAA TACTTTCACC TGCAGCTAAC CAACAGAAAT TACTTAAAAC 180 
CCTTCACCAA ACCTTCCACT TAGGCATTGA TAGCACCCAT CAGATGGCCA AATTATTATT 240 
TACTGGACCA GGCCTTTTCA AAACTATCAA GAAGATAGTC AGGGGCTGTG AAGTGTGCCA 300 

310 

AAGAAATAAT 

20 (2) INFORMATION FOR SEQ ID NO: 113: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Leu lie Glu Gly Pro Leu Val Trp Gly Asn Pro Leu Trp Glu Thr Lys 

pro Gin Tyr Ser Ala Gly Lys He Glu Xaa Glu Thr Ser Gin Gly His 
20 

Thr Phe Leu Pro Ser Arg Trp Leu Ala Thr Glu Glu Gly Lys He Leu 



30 



35 



40 



45 



ser pro Ala Ala Asn Gin Gin Lys Leu Lea Lys Thr Leu His Gin Thr 
Phe His Leu Gly He Asp Ser Thr His Gin Met Ala Lys Leu Leu Phe 
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65 

Thr Gly Pro Gly Leu Phe Lys Thr He Lys Lys He Val Arg Oly Cys 

85 90 '5 

Glu Val Cya Gin Arg Asn Asn 
5 100 

(2) INFORMATION FOR SEQ ID NO J 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 
j^Q (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

15 CCCTGTATCT TTAACCTCCT TGTTAAGTTT GTCTCTTCCA GAATCAAAAC TGTAAAACTA 60 

CAAATTGTTC TTCAAATGGA GCACCAGATG GAGTCCATGA CTAAGATCCA CCGTGGACCC 120 

CTG6ACCGGC CTGCTAGCCC ATGCTCCGAT GTTAATGACA TTGAAGGCAC CCCTCCCGAG 180 

GAAATCTCAA CTGCACAACC CCTACTATGC CCCAATTCAG CGGGAAGCAG TTAGAGCGGT 240 

CATCAGCCAA CCTCCCCAAC AGCACTTGGG TTTTCCTGTT GAGAGGGGGG ACTGAGAGAC 300 

20 AGGACTAGCT GGATTTCCTA GGCCAACGAA GAATCCCTAA GCCTAGCTGG OAAGGTOACT 360 

GCATCCACCT CTAAACATGG GGCTTGCAAC TTAGCTCACA CCCGACCAAT CAGAGAGCTC 420 

ACTAAAATGC TAATTAOGCA AAAATAGGAG GTAAAGAAAT AGCCAATCAT CTATTOCCT6 480 

AGAGCACAGC GGGAGGGACA AGGATCGGGA TATAAACCCA GGCATTCGAG CCGGCAACGG 540 

CAACCCCCTT TGGGTCCCCT CCCTTTGTAT GGGCGCTCTG TTTTCACTCT ATTTCACTCT 600 

635 

25 ATTAAATCTT GCAACTGAAA AAAAAAAAAA AAAAA 

(2) INFORMATION FOR SEQ ID NO: 115: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 
3Q (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
35 Pro cys He Phe Asn Leu Leu Val Lys Phe Val Ser Ser Arg He Lys 

in 15 



1 



• 
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Thr Val Lys Leu Gin He Val Leu Gin Met Glu Hi^ Gin Met Glu Ser 



20 



25 



30 



Met Thr Lys He His Arg Gly Pro Leu Asp Arg Pro \j.a Ser Pro Cys 

35 *0 
ser Asp val Asn Asp lie Glu Gly Thr Pro Pro Glu GUji^ 

50 55 60 

Ala Gin Pro Leu Leu Cys Pro Asn Ser Ala Gly Ser Ser \ 



65 



70 



75 



10 (2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

* 32 
TGGGGTTCCA TTTGTAAGAC CATCTGTAGC TT 

20 (2) INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1481 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
.XOGCCCTCC CTTATCATAC TTTTCTCTTT ACTGTTCTCT TACCCCCTTT CGCTCTCACT 

CCACCCCCTC CATGCTGCTG TACAACCAGT AGCTCCCCTT ACCAAGAGTT TCTATGAAGA 20 

30 ACGCGGCTTC CTGGAAATAT TGATGCCCCA TCATATAGGA GTTTATCTAA GGGAAACTCC 0 

.CCTTCACTG CCCACACCCA TATGCCCCGC AACTGCTATA ACTCTOCCAC TCTTTOCATO 240 

CATGCAAATA CTCATTATTG GACAGGGAAA ATGATTAATC CTAGTTOTCC TGGAGGACTT 00 

GGAGCCACTG TCTGTTGGAC TTACTTCACC CATACCAGTA TGTCTOATOO GOOTGGAATT 360 

c«>«^o c»o«=«- ■^-"■^"=«= "° 

35 ».»CCCCT» accccT.c« »ao»CT»orr =tctc««c «c»to«»c =c«cot.cc «o 

„T«:TC=CC «CC. »T™C. »CCCT»CTC occrcCATC. .CTCTC«=CC »o 



60 



W098/237S5 
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CAAAACCCTA CTAACTGTTG GATGTGCCTC CCCCTGCACT TCAGGCCATA CATTTCAATC 600 
CCTGTTCOTG AACAATGGAA CAACTTCAGC ACAGAAATAA ACACCACTTC CGTTTTAGTA 660 
GGACCTCTTG TTTCCAATCT GGAAATAACC CATACCTCAA ACCTCACCTG TGTAAAATTT 720 
AGCAATACTA TAGACACAAC CAGCTCCCAA TGCATCAGGT GGGTAACACC TCCCACACGA 780 
5 ATAGTCTGCC TACCCTCAGO AATATTTTTT GTCTGTGOXA CCTCAGCCTA TCATTOTTTG 840 
AATGGCTCTT CAGAATCTAT GTGCTTCCTC TCATTCTTAG TGCCCCCTAT GACCATCTAC 900 
ACTGAACAAG ATTTATACAA TCATGTCGTA CCTAAGCCCC ACAACAAAAG AGTACCCATT 960 
CTTCCTTTTG TTATCAGAGC AGGAGTGCTA GGCAGACTAG GTACTGGCAT TGGCAGTATC 1020 
ACAACCTCTA CTCAGTTCTA CTACAAACTA TCTCAAGAAA TAAATGGTGA CATGGAACAG 1080 
10 GTCACTGACT CCCTGGTCAC CTTCCAAGAX CAACTTAACT CCCTAGCAGC AGTAGTCCTT U40 
CAAAATCGAA GAGCTTTAGA CTTGCTAACC GCCAAAAGAG GGGGAACCTG TTTATTTTTA 1200 
GGAGAAGAAC GCTGTTATTA TGTTAATCAA TCCAGAATTG TCACTGAGAA AGTTAAAGAA 1260 
ATTCGAGATC GAATACAATG TAGAGCAGAG GAGCTTCAAA ACACCGAACG CTGGGGCCTC 1320 
CTCAGCC^T GGATGCCCTG GGTTCTCCCC TTCTTAGGAC CTCTAGCAGC TCTAATATTG 1380 
15 TTACTCCTCT TTGGACCCTG TATCTTTAAC CTCCTTGTTA AGTTTGTCTC TTCCAGAATT 1440 
6AAGCT6TAA AGCTACAOAT G6TCTTACAA ATG6AACCCC A "^l 

(2) INFORMATION FOR SEQ ID NO: 118: 
(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 493 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY J linear 
(ti) TYPE DE MOLECULE: peptide 

25 (Xi) SEQOENCE DESCRIPTION: SEQ ID NO: 118: 

5 10 
Thr Ala Pro Pro Pro Cys Cys Cye Thr Thr Ser Ser Ser 



Phe Ala Leu 

20 25 



30 



pro Tyr Gin Glu Phe Leu Kaa Arg Thr Arg Leu Pro Gly Asn He Aep 

40 

50 

HLs Thr HiB Met Pro Arg Asn Cys Tyr Asn Ser Ala T.r Leu Cys Met 

70 

„I. »=n T». Hi. TV. T.r =lv «« «n Pro S» C 
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15 



20 



25 



30 



35 



186 



90 



95 



Pro 61y 



85 

Gly Leu Gly Ala Thr Val Cys Trp Thr Tyr Phe Thr Hie Thr 



100 



105 



110 



Ser Met Ser Asp 
115 



Gly Gly Gly He Gin Gly Gin Ala Arg Glu Lys Gin 



120 



125 



val LyB Glu Ala He Ser Gin Leu Thr Arg Gly Hie Ser Thr Pro Ser 
130 

pro Tyr Lye Gly Leu Val Leu Ser Lys Leu His Glu Thr Leu Arg Thr 
145 

His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Arg Leu His 

165 170 "5 

Glu val ser Ala Gin Asn Pro Thr Asn Cys Trp Met Cys Leu Pro Leu 
180 

195 

Phe ser Thr Glu He Asn Thr Thr Ser Val Leu Val Gly Pro Leu Val 

215 220 
ser Tn Leu Glu He Thr His Thr Ser Asn Leu Thr Cys Val Lys Phe 
225 230 235 240 



225 

Thr He ASP Thr Thr Ser Ser Gin Cy- He Arg Trp 
245 

Thr Arg He Val Cys Leu Pro Ser Gly He Phe Phe Val Cys 



265 270 



pro Pro 

260 

Ala Tyr His Cys Leu Asn Gly Ser Ser Glu Ser Met Cys 



Gly Thr Ser 
275 



280 



285 



Phe Leu ser Phe Leu Val Pro Pro Met Thr He Tyr Thr Glu Gin Asp 
290 

Leu pro Phe Val He Arg Ala Gly Val Leu Gly Arg Leu Gly Thr Gly 

325 330 
lie Gly ser He Thr Thr Ser Thr Gin Phe Tyr Tyr Lys Leu Ser Gin 

'>ACL 350 

340 -^^^ 
355 360 365 
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187 



370 



375 



380 



Ala Leu ASP Leu Leu Thr Ala Lys Arg Gly Gly Thr Cys Leu Phe Leu 



385 

Gly Glu 



390 



395 



400 



Glu Arg cys Tyr Tyr Val Asn Gin Ser Arg He Val Thr Glu 



405 



410 



415 



Lys Val Lys 



Glu lie Arg Asp Arg He Gin Cys Arg Ala Glu Glu Leu 



420 



425 



430 



Gin Asn Thr Glu Arg Trp Gly Leu Leu Ser Gin Trp Met Pro Trp Val 

435 440 445 

^eu pro Phe Leu Gly Pro Leu Ala Ala Leu He Leu Leu Leu Leu Phe 
450 

Gly pro cys He Phe Asn Leu Leu Val Lys Phe Val Ser Ser Arg He 



465 



470 



475 



480 



Glu Ala val Lys Leu Gin Met Val Leu Gin Met Glu Pro 



485 490 



(2) INFORMATION FOR SEQ ID NO: 119: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
20 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
25 TCAAAATCGA AGAGCTTTAG ACTTGCTAAC C6 



32 



(2) INFORMATION FOR SEQ ID NO: 120: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 
2Q (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
35 TCAAAATCGA AGAGCTTTAG ACTTGCTAAC CCCCAAAAGA GGGGGAACCT GTTTATTTTT 60 




PCT/IB97/01482 
WO 98/23755 



188 



AATTTGAGAT CGAATATAAT GTAGAGCAGA GGACCTTCAA AACACTGCAC CCTGGGGCCT 180 
CCTCAGCCAA TGGATGCCCT GGACTCTCCC CTTCTTAGGA CCTCTAGCAG CTATAATATT 240 
TTTACTCCTC TTTG6ACCCT GTATCTTCAA CTTCCTTGTT AAGTTTGTCT CTTCCAGAAT 300 
TGAAGCTGTA AAGCTACAAA TAGTTCTTCA AATGGAACCC CAGATGCAGT CCATGACTAA 360 
5 AATCTACCGT GGACCCCTGG ACCGGCCTGC TAGACTATGC TCTGATGTTA ATOACATTOA 420 
RGTCACCCCT CCCGAGGAAA TCTCAACTGC ACAACCCCTA CTACACTCCA ATTCAGTAGG 480 
AAGCAGTTA6 AGCAGTT6TC AGCCAACCTC CCCAACAGTA CTTGGGTTTT CCT6TTGAGA 540 
OGGTGGACTG AGAGACAGGA CTAGCTGGAT TTCCTAGGCT GACTAA6AAT CCCNAAGCCT 600 
ANCTGG6AAG GTGACCGCAT CCATCTTTAA ACATGGG6CT TGCAACTTAG CTCACACCCG 660 
10 ACCAATCA6A GAGCTCACTA AAATGCTAAT CAGGCAAAAA CAGGAGGTAA AGCAATAGCC 720 
RATCATCTAT TGCCTGAGAG CACAGCGGGA AGGACAAGGA TTGGGATATA AACTCAGGCA 780 
TTCAAGCCAG CAACAGCAAC CCCCTTTGGG TCCCCTCCCA TTGTATGGGA GCTCTGTTTT 840 
CACTCTATTT CACTCTATTA AATCATGCAA CTGCACTCTT CTGGTCCGTG TTTTTTATGG 900 
CTCAAGCTGA GCTTTTGTTC GCCATCCACC ACTGCTGTTT GCCACCGTCA CAGACCCOCT 960 
15 GCTGACTTCC ATCCCTTTGG ATCCAGCAGA GTGTCCACTG TGCTCCTGAT CCAGCGAGGT 1020 
ACCCATTGCC ACTCCCGATC AGGCTAAAGG CTTGCCATTG TTCCTGCATG GCTAAGTGCC 1080 
TG6GTTTGTC CTAATAGAAC TGAACACTGG TCACTGGGTT CCATGGTTCT CTTCCATGAC 1140 
CCACGGCTTC TAATAGAGCT ATAACACTCA CCGCATGGCC CAAGATTCCA TTCCTTGGTA 1200 
TCTGTGAGGC CAAGAACCCC AGGTCAGAGA ANGTGAGGCT TGCCACCATT TGGGAAGTGG 1260 
20 CCCACTGCCA TTTTGGTAGC GGCCCACCAC CATCTTGGGA GCTGTGGGAG CAA60ATCCC 1320 

1329 

CCAGTAACA 

(2) INFORMATION FOR SEQ ID NO: 121: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: peptide 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Gin Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Lys Arg Gly Gly Thr 

in 15 



35 



Cys Leu Phe Leu Gly Glu Glu Cys Cys Xaa Tyr Val Asn Gin Ser Gly 
20 

lie lie Thr Glu Lys Val Lys Glu He Xaa Asp Arg He Xaa Cys Arg 

45 



35 



• 
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Ala Glu ABp Leu Oln Asn Thr Ala Pro Trp Gly Leu Leu Ser Gin Trp 



55 60 



50 

Met Pro Trp Thr Leu Pro Phe Leu Gly Pro Leu Ala Ala He He Phe 
65 



70 75 80 



Leu Leu Leu Phe Gly Pro Cys He Phe Asn Phe Leu Val Lys Phe Val 

85 90 " 

ser ser Arg He Glu Ala Val Lya Leu Gin He Val Leu Gin Met Olu 

100 105 110 

pro Gin Het Gin Ser Met Thr Lys He Tyr Arg Gly Pro Leu Asp Arg 



120 125 



10 115 

pro Ala Arg Leu Cys Ser Asp Val Asn Asp He Glu Val Thr Pro Pro 

130 135 1*0 

Glu Glu He ser Thr Ala Gin Pro Leu Leu His Ser Aen Ser Val Gly 

145 



15 ser ser 

(2) INFORMATION FOR SEQ ID NO; 122; 
<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
2Q (B) TYPE: nucleotide 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE; ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
25 GGCATTGATA GCACCCATCA G 

(2) INFORMATION FOR SEQ ID NO: 123: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
3Q (B) TYPE: nucleotide 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
35 CATGTCACCA GGGTGGAATA G 
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(2) INFORMATION FOR SEQ ID NO: 124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758 base pairs 

(B) TYPE: nucleotide 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
GGCATT6ATA GCACCCATCA GATGGCCAAA TCATTATTTA CTGGACCAGG CCTTTTCAAA 60 
10 ACTATCAAGC AGATAGGGCC CGTGAAGCAT GCCAAAGAAA TAATCCCCTG CCTTATCGCC 120 
ATGTTCCTTC AGGAGAACAA AGAACAGGCC ATTACCCAGG GGAAGACTGG CAACTAGATT 180 
TTACCCACAT GGCCAAATGT CAGGGATTTC AGCATCTACT AGTCTGGGCA GATACTTTCA 240 
CTGGTTGGGT GGAGTCTTCT CCTTGTAGGA CAGAAAAGAC CCAAGAGGTA ATAAAGGCAC 300 
TAATGAAATA ATTCCCAGAT TTGGACTTCC CCCAGGATTA CAGGGTGACA ATGGCCCCGC 360 
15 TTTCAAGCCT GCAGTAACCC AGGGAGTATC CCAGGTGTTA GGCATACAAT ATCACTTACA 420 
CTGTGCCTGG AGGCCACAAT CCTCCAGAAA AGTCAAGAAA ATGAATGAAA CACTCAAAGA 480 
TCTAAAAAAG CTAACCCAAG AAACCCACAT TGCATGACCT GTTCTGTTGC CTATAACCTT 540 
ACTAAGAATC CATAACTATC CCCCAAAAAG CAGGACTTAG CCCATACGAG ATGCTATATG 600 
CATGGCCTTT CCTAACCAAT GACCTTGTGC TTGACTGAGA AATGGCCAAC TTAGTTGCAG 660 
20 ACATCACCTC CTTAGCCAAA TATCAACAAG TTCTTAAAAC ATCACAGGGA ACCTGTCCCC 720 

GA6AGGAGGG AAAGGAACTA TTCCACCCTG GTGACATG 



(2) INFORMATION FOR SEQ ID NO: 126: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
3Q ^i5_) TYPE DE MOLECULE: ADNc 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

^ 25 
CGGACATCCA AAGTGATGGG AAACG 

(2) INFORMATION FOR SEQ ID NO: 127: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
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5 



(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

26 

GGACAGGAAA GTAAGACTGA GAAGGC 



(2) INFORMATION FOR SEQ ID NO: 128: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CCTAGAACGT ATTCTGGAGA ATTGGG 

(2) INFORMATION FOR SEQ ID NO: 129: 
(i) SEQUENCE CHARACTERISTICS: 
2Q (A) LENGTH: 26 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

TGGCTCTCAA TGGTCAAACA TACCCG 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

3Q (A) LENGTH: 1511 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

-IS rxi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CCT.L»T »«=0.C0» ^TC.CC, »=.COCT« <^^^ 



t 
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ATTTATATTC TTCTGCAGTA CCGCCTGGCC ACAIVTATCCT CTTCAAGGGA GAGAAACCTG X20 
GCTTCCTOAG GGAAGTATAA ATTATAACAT CATCTTACAG CTAGACCTCT TCTGTAGAAA 180 
GGAGGGCAAA TGGAGTGAAG TGCCATATGT GCAAACTTTC TTTTCATTAA GAGACAACTC 240 
ACAATTATGT AAAAA6TGTG GTTTAXGCCC TACAGGAAGC CCTCAGA6TC CACCTCCCTA 300 
5 CCCCAOCGTC CCCTCCCCGA CTCCTTCCTC AACTAATAAG GACCCCCCTT TAACCCAAAC 360 
6GTCCAAAAG GAGATAfiACA AAGGGGTAAA CAATGAACCA AA6A6T6CCA ATATTCCCCG 420 
ATTATGCCCC CTCCAA6CA0 TGAGAGGAGG AGAATTCGGC CCA6CCA6AG T0CCT6TACC 480 
TTTTTCTCTC TCAGACTTAA AGCAAATTAA AATAGACCTA 6GTAAATTCT CAGATAACCC 540 
TGACGGCTAT ATTGATGTTT TACAAGGGTT AGGACAATCC TTTGATCTGA CATGGAGAGA 600 
10 TATAATGTTA CTACTAAATC AGACACTAAC CCCAAATGAG AGAAGTGCCG CTGTAACTGC 660 
AGCCCGA6AG TTTGGCGATC TTTGGTATCT CAGTCAGGCC AACAATAGGA TGACAACAGA 720 
GGAAAGAACA ACTCCCACAG GCCAGCAGGC AGTTCCCAGT GTA6ACCCTC ATTG6GACAC 780 
AGAATCAOAA CATG6AGATT GGTGCCACAA ACATTTGCTA ACTTGCGTGC TAGAA66ACT 840 
GAGGAAAACT AGGAAGAAGC CTATGAATTA CTCAATGATG TCCACTATAA CACAGGGAAA 900 
15 GGAA6AAAAT CTTACT6CTT TTCTGGACAG ACTAAGGGAG 6CATTGAGGA AGCATACCTC 960 
CCTGTCACCT OACTCTATTG AAGGCCAACT AATCTTAAAO OATAAGTTTA TCACTCAGTC 1020 
AGCTGCAGAC ATTAGAAAAA ACTTCAAAAG TCTGCCTTAG GCCCGGAGCA GAACTTAGAA 1080 
ACCCTATTTA ACTTGGCATC CTCAGTTTTT TATAATAGAG ATCAGGAGGA GCAGGCGAAA 1140 
CGGGACAAAC GGGATAAAAA AAAAAGGGGG GGTCCACTAC TTTAGTCATG GCCCTCAG6C 1200 
20 AAGCAGACTT TGGAGGCTCT GCAAAAGGGA AAAGCTGGGC AAATCAAATG CCTAATAGGO 1260 
CTGGCTTCCA GTGCGGTCTA CAAGGACACT TTAAAAAAGA TTATCCAAGT AGAAATAAGC 1320 
CGCCCCCTTG TCCATGCCCC TTACGTCAAG GGAATCACTG GAAGOCCCAC TGCCCCAOGG 1380 
GAT6AAGATA CTCT6AGTCA GAAGCCATTA ACCAGATGAT CCAGCAGCAG GACTGAGGGT 1440 
GCCCGGGGCG A6CGCCAGCC CATGCCATCA CCCTCACAGA GCCCCGGGTA TGTTTGACCA 1500 

1511 

25 TTGAGAGCCA A 

(2) INFORMATION FOR SEQ ID NO: 131: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 352 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY; linear 
(ii> TYPE DE MOLECULE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
35 Leu Glu Arg He Leu Glu Asn Trp Asp Gin Cys Asp Thr Gin Thr Leu 

1 5 10 " 




PCT/IB97/01482 
WO 98/23755 



10 
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20 



25 



30 
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Arg Lya Lya Arg Phe He Phe Phe Cye Ser Thr Ala Trp Pro Gin Tyr 

20 25 30 

Pro Leu Gin Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr 

35 40 45 

ABn He He Leu Gin Leu Asp Leu Phe Cys Arg Lye Glu Gly Lye Trp 

SO 55 60 

ser Glu Val Pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser 
65 70 75 80 

Gin Leu cys Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser Pro Gin Ser 

85 90 
Pro Pro Pro Tyr Pro Ser Val Pro Ser Pro Thr Pro Ser Ser Thr Asn 

100 lOS HO 

Lys Asp Pro Pro Leu Thr Gin Thr Val Gin Lya Glu He Asp Lys Gly 

115 120 125 

val Asn Asn Glu Pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro Leu 

130 140 
Gin Ala val Arg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro 
145 150 155 160 

Phe ser Leu Ser Asp Leu Lys Gin He Lys He Asp Leu Gly Lys Phe 

165 170 "5 

ser ASP Asn Pro Asp Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin 

180 185 
ser Phe Asp Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr 

195 200 205 

Leu Thr Pro Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe 

210 215 220 

Gly ASP Leu Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu 
225 230 235 240 

Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro 

245 250 255 

His Trp Asp Thr Glu ser Glu His Gly Asp Trp Cys His Lys His Leu 

260 265 270 

Leu Thr cys Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met 
275 280 285 

35 Asn Tyr Ser Met Met Ser Thr He Thr Gin Gly Lys Glu Glu Asn Leu 



290 



295 



300 
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Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser 

315 320 

305 

Leu ser Pro Asp Ser He Glu Gly Gin Leu He Leu Lys Asp Lys Phe 



325 



330 335 



5 He Thr Gin Ser Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro 

340 345 350 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

30 

T6CT6GAATT C6G6ATCCTA GAACGTATTC 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

30 

AGTTCTGCTC CGAAGCTTAG GCAGACTTTT 

(2) INFORMATION FOR SEQ ID NO: 135: 
(i) SEQUENCE CHARACTERISTICS: 
3Q (A) LENGTH: 398 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: peptide 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Met Gly ser Ser His His His His His His Ser Ser Gly Leu Val Pro 
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10 



15 



20 



25 



30 



35 



10 



15 



U OXy ser His Met Ma Ser Met Thr OXy GXy Gin CXn Met Oly 

20 

XX« I.U Olu .rg lie Leu CIu Asn Trp Asp Gin Cys Asp Thr Gin Thr 

40 

.eu .r, Zs Lye Ar, Phe lie Phe Phe Cys Ser Thr Ma Trp Pro Gin 
50 

r,r pro Gin Gly .rg Gl« Thr Trp Leu Pro CIu Gly Ser lie ^sn 

70 '^^ 
Zr Ue lie Leu Gin Leu Ksp Leu Phe Cys Arg Lys GIu Gly Lys 



85 



val Gin Thr Phe Phe Ser Leu Arg Asp Asn 

110 



Trp Ser Glu Val Pro Tyr 
100 

Lys Lys cys Gly Leu Cys Pro Thr Gly Ser Pro Gin 



Ser Gin Leu Cys 
115 



120 



125 



130 



135 



140 



Asn Lys Asp Pro Pro Leu 



Thr Gin Thr Val Gin Lys Glu He Asp Lys 



145 

Gly Val Asn Asn 



150 



155 



160 



Glu pro Lys Ser Ala Asn He Pro Arg Leu Cys Pro 



165 



170 



175 



180 

P« Phe ser Leu Ser Asp Leu Lys Gin He Lys He Asp Leu Gly Lys 
195 

Oin ser Phe Asp Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin 
230 235 240 

„i i.r.r, sar Ala Ala Val Thr Ala Ala Arg Glu 
Thr Leu Thr Pro Asn Olu Arg Ser Ala Ai.a 

245 250 255 

260 265 270 

Glu Arg Thr Thr Pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp 



Glu 



275 



280 



285 



pro His Trp Asp Thr Glu Ser 



Glu His Gly Asp Trp Cys His Lys His 
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290 



295 



300 



X.e« Leu Thr Cys Val Leu Gl« Gly Leu Arg Lys Thr Arg Lys Lys Pro 

315 320 
305 310 315 

325 330 335 

I.eu Thr Ala Phe Leu Asp Arg Leu Arg Glu Ala Leu Arg Lye Hie Thr 
340 

ser Leu Ser Pro Asp Ser He Glu Gly Gin Leu He Leu Lys Asp Lye 



355 



360 



365 



10 



Phe lie Thr Gin Ser Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu 



370 



375 



380 



pro Lys Leu Ala Ala Ala Leu Glu His His His His Hie His 



385 



390 



395 



15 (2) INFORMATION FOR SEQ ID NO: 137: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(11) TYPE DE MOLECULE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Met Ala ser Met Thr Gly Gly Gin Gin Met Gly Arg He Leu Glu Arg 
5 10 ^5 

Glu Aen Trp Asp Gin Cys Asp Thr Gin Thr Leu Arg Lys Lys 



95 30 
20 25 



25 He Leu 

Arg Phe He Phe Phe Cys Ser Thr Ala Trp Pro Gin Tyr Pro Leu Gin 
35 

Gly Arg Glu Thr Trp Leu Pro Glu Gly Ser He Asn Tyr Asn He He 

cc 60 

30 50 

•7C 80 

70 ' 
pro Tyr Val Gin Thr Phe Phe Ser Leu Arg Asp Asn Ser Gin Leu Cys 
85 9° 

^ . ^ . r\^^ Tit^i^ nv*/-! 

35 Lys Lys Cys Gly Leu Cys Pro Thr Gly Ser 



100 



105 



110 
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pro ser Val Pro ser Pro Thr Pro ser ser Thr Asn Lys Asp Pro 



10 



20 



25 



30 



35 



Tyr 



120 



125 



P.O Lau Thr Gin Thr Val Gin I-ys Olu He Abp Lys Gly Val Asn Asn 
X30 

Olu pro Lys ser Ala Asn He Pro Arg Leu Cys Pro Leu Oln Ala Val 

160 



150 



155 



145 

Gly Gly Glu 

165 

He Lys He Asp Leu Gly Lys Phe 

190 

Gly Tyr He Asp Val Leu Gin Gly Leu Gly Gin Ser Phe Asp 



^rg Gly Gly Glu Phe Gly Pro Ala Arg Val Pro Val Pro Phe Ser Leu 

165 170 
ser ASP Leu Lys Gin He Lys He Asp Leu Gly Lys Phe Ser Aep Asn 



180 



Pro Asp 



200 



205 



Leu Thr Trp Arg Asp He Met Leu Leu Leu Asn Gin Thr Leu Thr Pro 

15 Asn Glu Arg Ser Ala Ala Val Thr Ala Ala Arg Glu Phe Gly Asp Leu 

230 235 240 

225 230 

Trp Tyr Leu Ser Gin Ala Asn Asn Arg Met Thr Thr Glu Glu Arg Thr 



245 



250 



255 



Thr pro Thr Gly Gin Gin Ala Val Pro Ser Val Asp Pro His Trp Asp 



260 



265 



270 



Thr Gl 



u ser Glu His Gly Asp Trp Cys His Lys His Leu Leu Thr Cys 



275 



280 



285 



val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Pro Met Asn Tyr Ser 



290 



295 



300 



Met Met ser Thr He Thr Gin Gly Lys Glu Glu Asn Leu Thr Ala Phe 

310 315 320 

305 310 

Leu ASP Arg Leu Arg Glu Ala Leu Arg Lys His Thr Ser Leu Ser Pro 

325 330 335 

ASP ser He Glu Gly Gin Leu He Leu Lys Asp Lys Phe He Thr Gin 
340 

ser Ala Ala Asp He Arg Lys Asn Phe Lys Ser Leu Pro Lys Leu Ala 
355 



Ala 



Ala Leu Glu His His His His His His 
370 375 



(2) INFORMATION FOR SEQ ID NO: 138: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
5 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
CTTGGAGGGT GCATAACCAG GGAAT 

10 (2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
3^5 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
TGTCCGCTGT GCTCCTGATC 

20 (2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140 
CTATGTCCTT TTGGACTGTT TGGGT 

30 (2) INFORMATION FOR SEQ ID NO: 141: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

..xIcL acTCCTO.Tc ccccocc CCCC.T.CCC Tcxccc^- c™o 

IT^L TTCCTOC^C. GCT^^aTOCC CTAAXCO.CC TO,U.C*CT.a 

"Z^Z ™cx c™c cc« x^x.o.ocT 

rcLoOXC C^CXXCC XXCCXXOO^ TCCOXO.C.C c^c^cccc .coxc^o^A 
rdloccx xacc.cc.xa xxco..cc.o ccc.cc.cc. xxxxcccc .occcocc.c 
::^rXOCO. CCXCXOOCO COO.CCCC .OOT-CT xxocxo.cc. co^^o^ 
XCrTCCCC .CC.TO..Oa CXCXCC. GOUVXTOC .TOXTCCXCC cooc^ 
rcCCCX.. C.X0X.XXCX CO.O..XX00 O.CC..XXXO .CCCXC.O.C .0X^0^ 
ZxG.CTX. X.XXCTXCXO C.OT.CCCCC CTOCCC.CO. X.XCCTCXXC ..GOG0O.G. 

CCXO.OOC.. GX.X...XX. XX.C.OCX.G .ccxg™ 

"g..;U.CO. OCC...XOG. GXO..OXGCC .X.XXX.C .CXXXCXXXX C.XX....O. 
lirGC XX.XGXX..C .GXGXG.XXX OXOXXCCX.C .CGCGCCC XC.G.XXCX. 
CXCCCC.CCC CCGGCXCXC CCCTGW.XCC CXCCCCWVCT X.XX 



15 



20 



(2) INPORMAXION FOR SEQ ID NO: 142: 
(i) SEQUENCE CHARACXERISTICS: 

(A) LENGIH: 800 base pairs 

(B) XYPEz nucleotide 

(C) SXR.NDEDNESS : single 

(D) lOPOLOGY: linear 
(ti) TYPE DB MOLECULE: .DNC 
(xU SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

CCTCCT^TC C»=««==C OCCC.T.OCC .C,CCC«.T 00~ 

,5 Lc««c ™cToc»c. ccT«0T0cc TOoarrcMC =T«TCC»=C T=««CT« 

^OTX CC»C«=TT=T CC.TOOCTTC ,»T.=«.C =0 

ZZcOrC 0^.^0C. TTCC™0« TCCCTC^KC C«C«CCCC ^CTCXO^ 
rrCOCT TOCCCC^ ™O«>=C.0 CC»OC»C» TT«0»OC OCCCCCCC. 
rcT^CC. C«=O.C=C= C»=.T»C« TT^TO.CC 

====== : 

======= 



60 
120 
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800 



AACAGTCCAA AAGGACATAG 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:169: 

« ^A^ + 42/68-1 + cl43 68-1) 

consensus (41/ob-i t^/^o 

(2) INFORMATION FOR SEQ ID NO: 170: 
(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LENGTH: 438 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNC 

20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

GACTTGAGCC AGTCCTCATA CCTGGACACT CTTGTCCTTC GGTACATGGA TGATTTACTT 60 

TTAGCCACCC ATTCAGAAAC CTTGTGCCAT CAAGCCACCC AAGCACTCTT AAATTTCCTT 120 

CCTACCTGTG GCTACAAGGT TTCCAAACCA AAGGCTCAGC TCTCCTCACA CCAGGTTAAA 180 

TACTTAGGGC TAAAATTATC CAAAGGCACC AGAACCCTCA GTOAGGAACG TATCCAGCCT 240 

25 ATACTGGGTT ATCCTCATCC CAAAACCCTA AAGCAACTAA CAGCGTTCCT TGGCATAACA 300 

OGTTTCTOCC AAATATGGAT TCCCAGGTAC AGCAAGATAG CCAGACCATT AAATACACGA 360 

ATTAAGGAAA CTCAAAAAGC CAATACCCAT TTAGTAAGAT GGACACCTGA AGCAGAAGTG 420 
GCTTTCCAGG CCCTAAAG 

30 (2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 



438 
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(Xt) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

O^CtLgCC AOTCCTCT. CCTGCCACT CTTOTCCXTC TG^TTTACTT 60 

""icCC .TTCa^C CTXOTOCC.T C^OCCCCC ^OCCXCXT .^TXXCCXX 0 

aCTACCTGTG GCTACAAGGT TTCCAAACCA AAGGCTCAGC TCTGCTCACA GCAGGTTAAA 80 

X^VCTTAGCGC TAAAATTATC CAA.GGCACC AGAACCCTCA GTGKGGAACG TATCCAGCCT 240 

;.TACTGGGTT ATCCTCATCC CAAAACCCTA AAGCAACTAA CAGCGTTCCT TGGCATAACA 300 

GGTTTCTGCC AAATATGGAT TCCCAGGTAC AGCAAAGTAG CCAGACCATT AAATACACGA 360 

.TTAAGGAAA CTCAAAAAGC CAGTACCCAT TTAGTAAGAT GGACACCTGA AGCAGAAGTG 400 
GCTTTCCAGG CCCTAAAG 



10 



60 



(2) INFORMATION FOR SEQ ID NO: 172: 
(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 438 base pairs 
(B) TYPE: nucleotide 
^5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
GACTTGAGCC ACTCYTCATA CCTGGACAYT CTTGTCCTTC GGTACATGGA TGATTTACTT 
20 TTAGCCACCC ATTCAGAAAC CTTGTGCCAT CAAGCCACCC AAGCACTCTT AAATTTCCTT 120 
GCTACCTGTG GCTACAAGGT TTCCAAACCA AAGGCTCAGC TCTGCTCACA GCAGGTTAAA 180 
TACTTAGGGC TAAAATTATC CAAAGGCACC AGAACCCTCA GTGAGGAACG TATCCAGCCT 240 
ATACTGGGTT ATCCTCATCC CAAAACCCTA AAGCAACTAA CAGCGTTCCT TGGCATAACA 300 
GGTTTCTGCC AAATATGGAT TCCCAGGTAC AGCAAAATAG CCAGACCATT AAATACACGA 360 
25 ATTAAGGAAA CTCAAAAAGC CAATACCCAT TTAGTAAGAT GGACATCTGA AGCAGAAGTG 400 

438 

GCTTTCCAGG CCCTAAAG 

(2) INFORMATION FOR SEQ ID NO: 173: 
(1) SEQUENCE CHARACTERISTICS: 
3Q (A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: peptide 

35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 173: 

DLSQSSYLDT LVLRYMDDLL LATHSETLCH QATQALLNFL ATCGYKVSKP 50 
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KAQLCSQQVK YLGLKLSKGT RTLSEERIQP ILGYPHPKTL KQLTAFLGIT 
GFCQIWIPM SKIARPLNTR IKETQKANTH LVRWTPEAEV AFQALK 

(2) INFORMATION FOR SEQ ID NO: 174: 
5 (t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) "^^^^ °^ MOLECULE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174: 
OLSQSSYLDT LVLRYMDDLL LATHSETLCH QATQALLNFL ATCGYKVSKP 
KAQLCSQQVK YLGLKLSKGT RTLSEERIQP ILGYPHPKTL KQLTAFLGIT 
GFCQIWIPRY SKVARPLNTR IKETQKASTH LVRWTPEAEV APQALK 

15 

(2) INFORMATION FOR SEQ ID NO: 175: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
DLSQSSYLDX LVLRYMDDLL LATHSETLCH QATQALLNFL ATCGYKVSKP 
25 KAQLCSQQVK YLGLKLSKGT RTLSEERIQP ILGYPHPKTL KQLTAFLGIT 
GFCQIWIPRY SKIARPLNTR IKETQKANTH LVRWTSEAEV AFQALK 

(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

consensus ( l/46-7+8/46-7+cl5/46/7) 



100 
146 



50 
100 
146 



50 
100 
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(2) INFORMATION FOR SEQ ID NO« 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 
5 (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOIiOcy: linear 

(ii) TYPE DE MOLECULE: ADNc 

ixt) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
,0 OACtLIcC ACTCCTCATA CCTOOACATT CTTOTTCTTC AOTATOCOOA TOACTTAATT 0 

ircCACCC ATTCAOAAAC CTTOTCaCAT CAACCCACCC AAOCCCTCTT AAATTTCCT 
rACCXCTC OCTCCAAACA AAACGCTCAC CTCTCCTCAC ACCAOOTTAA ATACTTAGOO 
C^TTAT CCAAAGTCAC CAGGGCCCTC AGAGACCAAC GXATCCAGCG TATACTGGCT 4 
™tc CCATAACCCT AAAGCAACTA AGAGOGTTCC TTGGCATATC AGCCTTCTGC 00 
,5 rrTATOGA TTCCCGGATA CAGTOAAATA GCCAGGCCAT TATGTACATT aattaacoaa 

TcrGAAAG CCAATACCCA TATAGTAAGA TGGACACCTG AAACAGAAGT GGCTTTCCAG 420 
GCCCTAAAG 

(2) INFORMATION FOR SEQ ID NO: 178: 
20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) TYPE DE MOLECULE: ADNc 

(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 178: 
aAcrLocc AGTCCTCATA CCTCGACATT CTTGTTCTTC AGTATAGGGA TGATTTAATT 0 
Tgccccc ATTCAGAAAC CTTGTGGCAT CAAGCCACCC AAGTGCTCTT AAATTTCCTC 
rACCTCTG GCTCCAAACA AAGGGCTCAG CTCTGCTCAC AOCAGGXTAA ATACTTAGGG 0 
30 irALTTAT CCAAAGTCGC CAGGGCCCTC AGAGAGGAAC GTATCCAGCG TATACTGCAT 0 
™TC CCAAAACCAT AAAGCAACTA AGAGGGTTCC TTGGCATAAC AGCCTTCTGC 
icrATGGA TTCCCCGATA CAGTGAAATA GCCAGGCCAT TATGTACATT AGTTAAGGAA 
::rAGAAAG CCAATACCCA TATAGTAAGA TGGACACCTG AGACAGAAGT GGCTTTCCAG 420 
GCCCTAAAG 

35 

(2) INFORMATION FOR SEQ ID NO: 179: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNBSS: single 
5 (D) TOPOIiOGY: linear 

(ii) TYPE DE MOLECULE: ADNc 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 

^GCAACTA AGARGGTTCC TTGGCATAAC AGCCTTCTGC 300 

'^Z^ ™T.0T«C. TOCKCCCTO -TTTCC«= «0 

15 GCCCTAAAG 

(2) INFORMATION FOR SEQ ID NO: 180: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 
2Q (B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
25 DLSQSSYLDI LVLQYGDDLI lATHSETLWH QATQALLNFL ATCGSKQKAH 
.CSHQVKYLC LKLSKVTRAL REERIQRILA YPHPITLKQL ROFLGISAFC 
RIWIPGYSEI ARPLCTLIKE TQKANTHIVR WTPETEVAFQ ALK 

(2) INFORMATION FOR SEQ ID NO: 181: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 
35 (ii) TYPE DE MOLECULE: peptide 

(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:181: 



100 
143 



t 
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„„„^™ OATQVLLNFL KTCGSKQBAQ - ^0 

5 .2, INFORMATION FOK SEQ ID NO: X82: 
a) SEQUENCE CHARACTERISTICS. 

(A) length: 143 amino actdB 

(B) TXPEt amino acid 

(C) stRRNDEDNESS: single 

(D) TOPOLOGY: linear 
«5.i> TYPE DE MOLECULE-, peptide 

™«»»v. «.K 

^2) INFORMATION FOR SEQ ID NO: 183: 
^i) SEQUENCE characteristics: 
<A) LENGTH: 25 base pair a 
2Q (B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
<li) TIPE DB MOLECULE: ADNc 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183; 

25 G6CCAGGCAT CAGCCCXAGA CTTGA 

(2) INFORMXTION FOR SEQ ID NO: 184: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: 

35 TGCAAGCTCA TCCCTSRGAC CT 



50 
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(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) TYPE DE MOLBCtJLE: ADNc 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185; 
GACTTGAGCC AGTCCTCATA CCT 



10 



(2) INFORMATION FOR SEQ ID NO: 186: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) TYPE DE MOLECULE: ADNc 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 186; 
CTTTAGGGCC TGGAAAGCCA CT 
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TABLE NO. 5 
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CIAIMS 

^-^4=1 in the isolated or 
1. Nucleic material, m 

a nucleotide sequence selected 
purified state, comprising a rn no-93, SEQ ID 

5 from the group including sequences ^^^/^ J^^ ' J,,^, 
NO- 94 their complementary sequences and ^^^"^ ^^'^^^^^^ 
seque;ces, in particular nucleotide -^-nces /xsplaying 
tTr any succession of 100 contiguous monomers, at least 
for any homology with said 

50% and preferably at least ou-5 

^EO ID NO:93, SEQ ID NO:94 and their 

^ Pduivalent sequences, in parT:xcuxaj. 

iR cseauences and their equxvcij-cnu ^ . * 

15 sequence ^^e^ip^vina for any succession of 100 

nucleotide sequences displaying, ror a y 

=.4- ioa«5t 70% and preferably at lease 

purified state, ceding tor any polypeptide aisplayxn, for 

^ nf at least 30 amino acids, ar 

anw f-ontiauous succession of a-c xea&v. 

any contiguo ^^^^ preferably 

least 50%, preferably at leasi: ou , ^ k.7 

Tt least 70% ho.olo,y with a peptide se^ence ^ 
:^y nucleotide se<r>ence .elected £ro. the group including 
Z ID HO=93, SEQ ID NO=94 and their complementary 



25 

SEQ 



sequence.^ Hucleic material, in the isolated or 
'state ot retroviral type, comprising a 
,0 ^1 UdeC^nce identical or eguiv.lent to at le.»t 
part Of the pol gene of an isolated retrovirus associated 
'ith multiple sclerosis or rheumatoid arthr.txs. 

5 »ucl.lc material as claimed in claim 4, 
said nucleotide sequence being 80 * homologous to said at 
35 least part of the pol gene. 
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6. 



e. Nucleic material co^prlstn, a "-^-^^^ 
.e^.„c. identical or equivalent to at least part ot^ 
rJl aene o£ an isolated virus encoding a reverse 
fr nsIrXtase co.prisin, an en.y^atic site c^pn.^ 

^tween tne a»ino acid ^-•^"J "'^,^!,T'o. « ± 0.1, 
having a phylogenic distance with KSEKV 

end pre.erahlv _,.,3in, a nucleotide 

seguence Selected fro. the group including SE« ID H0=,3 
seI id «o: 94, their complementary seguences and their 
SEQ ID NO. ^articular nucleotide seguences 

rraytr. r ::; success"; o. lOO contiguous mongers, 
: Tast ^0* and 'preferahly at least 60* hcologv ^ 
.aid seguences and their compleBentary seguences, said 
, tp eluding SBQ 10 N0=1, and said 

not comprising nor consisting of the "■'--^"^^ 

8 Nucleotide fragment of claim 7 , nucleotide 
eeguenc. of which is selected from the group including SEQ 
'r^o .3, SEO ID ho: 94, their complementary seguences and 
0 their eguivalent seguences, in particular nuc eotide 
„i.„l.ving for any succession of 100 contiguous 
sequences displaying, lo^^ / ,^^^4. qoS: 

i=a.t 70% and preferably at least 80S 
monomers, at least an p complementary 

homology with said sequences and their co p 

seguences ^^^^^^^.^^ ,^,^„,„, comprising a coding 

nucleotide- seguence which is at -^/^^Htrup 
to a nucleotide seguence selected from the group 

including =^ „ K0=,3, 10 NO:94, their complementary 

30 seguences ; their equivalent seguences, in particular 
homologous to SEQ ID N0.3 SEQ ID N0=,4; 

sequences encoding at leasi: ya^ 

sequence defined by SEQ ID ^ 

sequences encoding at least part ot a P P 

35 seguence equivalent, in particular " /^^^ 

N0:«, Which is capable of being recognized by 



t 
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p.tient. inccted .ith the HSRV-l virus, or in wh» the 
„3.v-i vir. ... . a 

5 arthritis, characterized in that it is cap 

:rbriaizing specifically with any fragment according to 

any one ^^^^^^^^^^^ XO, consisting of 

between 10 and 1,000 monomers. , .^.^.^-Hon bv 

12 primer for the amplification by 

^° ' r,M^ r.y a DNA of 3 vlral material 

polymerization of an or a DHA of 

associated «ith multiple solerosis o 
arthritis, ooBprisin, a nucleotide sequence identical or 
e^ivalen; to at least one portion of the nucle^ide 
sequence of a fragment as claimed in any one of claims 7 
to 9 m particular a nucleotide sequence displaying, for 
any 'succession of at least 10 contiguous monomers 
oreferably 15 contiguous monomers, more preferably 18 
rntCois monomers and even most preferably 
.0 monomers, at least 70» homology with at least the said 
Dortion of the said fragment. 

13 primer as claimed in Claim 12, comprising a 
sequence selected from the group consisting of SEQ ID NO: 

99 to SEQ ID NO: 111- 

14 polypeptide encoded by any open reading 

frame belonging to a nucleotide fragment as claimed in any 

one of claims 7 to 9. ^ .waracterized in 

15, 



30 



35 



1. polypeptide of claim 14, characterized 
that the open reading frame encoding it, is comprised, in 
fhe 5 -3- 'irection, between nucleotide 18 and nucleotide 

2304 of SEQ ID 110:93. 

16 polypeptide according to claim 15, 
comprising 'a peptide sequence at least partially identical 
SEC Z.«0: , ,,,,,ae sequence 

at least partially Identical to SEQ ID NO: 96. 



f 
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^'A^ claim 17 exhibiting an 

18. Polypeptide of claim i/ 

20. polypeptide exhibiting an inhibitory 
activity on the proteolytic activity of polypeptxde of 

"'21. polypeptide, comprising a peptide sequence 
identical or equivalent to SEQ ID NO: 97. 

22 polypeptide of claim 21, comprxsing a 
peptide sequence identical or equivalent to SEQ ID NO: 98 
^ 23 polypeptide, characterized in that the open 

reading frame encoding it begins, in the 5-3. ^^i--^^-' 
at nucleotide 341 and ends at nucleotide 2304 of SKQ ID 

24 polypeptide, characterized in that the open 
reading frame encoding it begins, in the 5.-3. direction 
at nucleotide 1858 and ends at nucleotide 2304 of SEQ ID 



15 

NO:93. 



20 

NO: 93 . 

25. 



polypeptide of claim 21 or 23. exhibiting a 

reverse transcriptase activity. 

26. polypeptide of claim 22 or 24, exhibiting a 

25 ribonuclease H activity. *„hihitQrv 

27 polypeptide exhibiting an inhibitory 
activity on the reverse transcriptase activity of 

Dolvpeptide of claim 25. 4..„4 4.„ 

28 polypeptide having an inhibitory activity 
30 on the ribonuclease H activity of polypeptide of claim 26^ 

29 Antigenic polypeptide recognized from the 
sera of patients infected with the HSHV-1 ^i-- -^^^^^^^^^ 
„nom the MSRV-1 virus has been ^^^^f ^ly 
4n that its peptide sequence is at least partially 

. ITll or is equivalent to a sequence selected from the 
35 identical or is equi fragments thereof, 

group consisting of SEQ id wo.^*^, 
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10 



15 



in p.rticul« SEO XO N0:«, SBQ ID H0,.7 ana SKO 10 K0 = 

30. Mono- or polyclonal 'f^^^J'^^X 
. 4. MSRV-1 virus, characterized in that 

rr^nld r UunoloWcal reaction a .u.,n or 

anlrxlcay or cXXs .o an i«uno,eni= a.ent consist.n, o. 
an an.X,.nic P»^VP-«^;- .^.^on o. «>e HS^V-X 

.r... or an — ^^^o irao;r ^.1^ 
in that it ccprxsa. at °" ^^^^^^ 

.»iaoted troB the group consisting ot a proo 
In ci::. XO or XX a poXypeptide as oXai»ea in any one 
. ... n to 29 I or an antibody as claimed in clai. 30. 
elaiBS X, to prophylactic or therapeutic 

position m particular for inhibiting the expression 
rrvlr^s associated with BuXtipXe sclerosis or 
rheum-toid arthritis, and/or the enzymatic activity of the 
rroteins of said virus, said composition comprising a 
nucleotide fragment of any one of claims 7 to 

33. Diagnostic. prophylactic or therapeutic 
composition comprising a polypeptide of any one of claims 
14 to 29, or an antibody of claim 30. 

3, process for detecting a virus associated 
„ith multiple sclerosis or rheumatoid » 
> biological sample, characterized in that an lam and/or a 
' presumed to belong or originating from said virus or 

their complementary RHA and/or DHA, is/are brought into 
Tontlct with a nucleotide fragment according to any one of 



Claim 7 to 9. ^^^^^^^ ^^^^^^^^ 
" ■ virus associated with multiple sclerosis or 



35. 

exposure to a 



exposure to a V ^a.ple, wherein said 

rheumatoid arthritis, n ^^^^^^^ ^^^^ ^ poXyeptide, 
sample is brought i antibody of 

according to any one of claim 14 to 23. 

35 claim 30. 
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G I A L i ^ - 8S 
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V « « H L f C <l * , \ T - A S S H 
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TGrCCTTCGCT 
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S^AAGGAAATTGATGrAGrGGCAAAGGGrrr 
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Cn-GCrGATTGGTGTATTTACAATCCT^^ 
CAATATGGAAAGAAAGGGACnjC^^^^ 

^^ccacaagtaaatcatggagtta™^^^ 

^^^CAACC^^mCCTA^^^^ 



SAGGATTlWCAOACAOTrrOCAAO 

??ATACCAACCrCTGGAGTTGGGC/^CA^^^ 

^^??GAOAGarGAA^^^^^ 
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SEQ ID NO 52 (JLBcl) ^^.^^r^. 
CCTCTATACTGTAATCAAGGAGACTCACO^^^^^^^^^ 



rATGCGAACTAGGa^jC^^^^ 
GTACAATCTCCAGCTITAAGCCTOC^^ 
ATCACAGAGAGGGCAG^AT^GCT^ 
ACTACCCCACAACCACTWCACJ^CTA^^^ 

AAAAGGCTGGCCTCACTGTrTATGGGTAGCrtJruo luvj i« 



AAAAGGCTC^C^^^--"-^^^^^ 

?J?i?5A^A™^r^'^^^ 

CAACCACCTGCTTAGATACCAC 



CrCATGATGTAATGGCATACTAGGTC^^^ 
CAACCACCTGC^G^CCA^^^ 



cc™gca™ccaa^^^^ 

GATGATCTCITAGA^rcCr^^^^ 

GAGGAGGAGACAGAGAGGAAGAGACAGAG^C^ 
AGACAGAGAGAGGAAGAGACAGACAGAAAG^ 

GAGGAAGAGACCAAGGACTTCNAG^ 
AGAAAAAAt^TT^A^C^^OTTAJ^ 
TATAATTOATAATrGAGnCrrcC^^ 
AAACCCTCAACCGAIVJG^^ 

ccAGCCAATACCCCcrrorr^^ 

AGGGAGACCTCTGACAATTCATA^^ 

CAGGTmCTAA/UVG^^^^^^^^ 
AAACCAGATCTAGGAGGAACTTOTCA^^^^ 




AGAA- 
GAGGTITTA 



CAACTAAAGTAA/aTITACTAA^ 
CAGTGACTCTC 
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ACrarcAACAAATTAGAGrrcCAG/^^ 

^^ATACAGATAGCAi«3I^^^^^aO^CCCCCATrA 
oX^AOOiOAAAGACAGAAAGTCAWO*^^ 



AAAffrGAGAAAGAGAGATCKS/^TACTA^^ 

iS^AAAAGCCAGGGTAAA™ 
^:SSa:ATAACCCTATJ^C^AAAgA^ 

T^t,*«r^ATAGCCCAAAAGCACTGAGGCCAOiUA^^^^^^^^^^ 



SCTAATATrTGGTCTGCTCAAATGTC^^^^^^^ 

?SIii?TACrrACAG/^AGG/^^^^ 
SiATCAGCTGAACAAGTrCTTATTA^^^^^^ 

AA^CAAAGrmCA^Aj^^^^^^^ 
ArA-TOTATTATCCTAACrrCTAATCTTCTGOA^^ 

CC^^CTGAAGTCCATC/^^AT^^^^ 

^^Sagggttaggaatggccac^^^^ 



AC AAO AAA 1 '^Xv-rr 

ggcagcagtgactctc 
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X TTCCTCACTT CT^CTAA CCTCAAAT<;A OAOA.GTGCC CCCATAACTC ^^^^^^ 
ei GTn^^CAT CCC^TATC TC^AGCJ ATC^^J^ .CACCAATC 

121 A^K^ATTCCCC ACAGGCCAGC ^ScT^ G1<5CTAGAAG GACTAAGGAA 

181 AGAACATGGA GATTC3GTGCC GCAGACATTT ^ cACAGGGGAA AGGAAGAAAA 

241 AACTAGGAAG ATATGAATTA ^TCAATG^ AAGCATACCA GGCAAGTGGA 

301 TCCTACT^C rrrCTGGAGA GACTAAGQGA atcTCTAATA GGGCTTGCTT 

361 CATTCGAGGC TCTGGAAAAG GGAAAAGTTG atAGAAATAA GCCACCACCT 

421 CCAGTGTGGT CTACAAGGAC ACTTT*^ ^USS ACTGCCCCAG GGGA-«5AAGG 
481 CG.CCA1XXC CCTTATGICA J^^^^ ^^Sc AGGACK^AGG GTGCCCGGGG 
541 TCCTCTGAGT CAGAAGCCAC TAACCAGATG ^TCCA ^TCAC CATTGAGGGT 

601 CAAGCGCCAG CCCA1«:CAT CACCC^A oTCTTACTTT CCOX^CTGG 

661 CAGAAGGGTA CTGTCTCCTC GACACTG^ SgSSa GGACAGCCAG TCACTAGATA 
721 ACAACI^TCC '^^'^ ^^S^S S^SS TTCCACATCC TTTTCTAATT 
781 CTTCTCCCAG CCACTAAGTT GTGACTG^ ^tcAGACATT CTAGCAAAAG CAGGGGCCAT 
Ifl ATGC<n«AAA GCCCCACTCT CTT^^ ScClX^ AGGAAGGAAT 

901 TATACA-TCTG AATATAGGAG J*^^^ SgACAA GCAAAGAATC CCCGTCCTGT 
961 TAATCCTGAA GTCCGGGCAA J^^^ ^CCtScAA AGGCAGTACC CCCTCAGACC 
1021 TCAAGTTAAA CTAAAGGATT CCA^^^ S^GGACCTA AAAGCCCAAG GCCTAGTAAA 
ir/i SSSSS SSSSS r^rx -AGGAG^A AGGAAACCCA ACGGAC 
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ACTCTCARTTCTTGTTTCCCTTTSCCTll ^^^^^^^^^^^^^^ 
CAGGCATTAGCCCAA 



Leu-Phe-Gly-Gln-Ala-Leu-Ala-Gln 

SEQ ID HO 39 (POL2B) 
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OVIVOEBI 
VOOdlHdS 

diHdsqad 




dsaHdOQd 
saudoqdi 
audoodiA 
adoodiAi 

dOOdlAiJ^A 
OOdlMMl 
OdlAlVyVLl 

IMllOS-Ld 

XlbsidNl 
lOSldNld 
OSidNldO 

sxdiMidqa 
JdNidqad 
dNidqadv 

QdUAdJOd 

dHAdlOdd 
BAdlOddV 

AdlOddVQ 
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DVlVOOdl 

VlVOOdlH 
IVDOdlHd 
VOOdlHdS 
OOdlHdSq 

diHdsaud 
nHdsaadO 
Hdsaydoq 
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osadUAdi 
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QdUAdlOd 
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HAdlOddV 

Adioddva 
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Gln-Ala-Leu-Ala 
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p IQj 38 AAQGAAZOC AGAAZOXAA SO 
Q KETQ KAN THL VRWT PEA 
RKLRKPIPI. .DGHQKQ 
GNS ESQ YPFS KMD TRS 

Aa^AQC3^QCr TICCAGGCDC TftAAGAAAIC CCTAADCJCAA GCIOQCaGIGr 100 
EAA FQAL KKS LTQ APVL 
KQL SRP .RNP .PK PQC 
RSSF PGP KEI PNPS PSV 

TAAQCTIQCC AAOOOOQCAA GftLTiTiUiT TM3ffiI[C3^ ASAAAMC^G 150 

SLP TGQ DFSL YVT EKQ 
.ACQ RGK TFL YMSQ KNR 
KLA NGAR LFF ICH RKTG 

G^^AEftOCTCr AQGAGKXnT iOOGGICC MGOGACMG CTIGCAACCT 200 
E.L. ESL HRS KGQA CNL 
NSS RSPY TGP RDK LATC 
lAL GVL TQVQ GTS LQP 

Gia3C3ffiADC TGAGTI^^ 2S0 
WHT .VRKLMXWQRVGLI 
GIP E.G N.CX GKG LAS 
VAYL SKE TDV XAKG WPH 

TIGmSOG GTRGQCXaGC AGIMCAGTC TiaGITKnG AMCAGITAA 300 

VYR .GS SSSL SF. NS. 
LFTG RAA VAV LVSE TVK 
CLQ VGQQ .QS .FL KQLK 

AftOMaSOG QGAftC^GftTC TISOGIGIG GACfiTCTCAr GATOIGAAOG 350 
NNTG KRS YCV DIS. CER 
IIQ GRDL TVW TSH DVNG 
.YR EEI LLCG HLM M.T 

GC3VEACnX2^ TQCTAAZ^^ 400 
HTH C-RG LVA VRQ PFT, 
ILT AKE DLWL SDN HLL 
AYSL LKR TCG CQTT lYL. 

AAAEAGCftGG TTCTATIXO' TCAftGflQOCA GIGCIQGGaC T3CftCmTC 450 

lAG SIT .SAS AAT AHL 
K.QV LLL EVP VLRL HIC 
NSR FYYL KCQ CCD CTFV 

TOGAMTCTT AADCXmrA CMTI^^ 500 
CNS. PSHISSRQ.RKDR 
ATL NPAT FLP DNE EKIE 
QLL TQP HFFQ TMK KR. 
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1^^A'^ff7fi9Q l?^^'^fi7Rqn 1234 Rfi7ftqn 12345fi7R90 1234567890 

F1G38 A20aftftCTC TCAACRftOm 550 
' l T.LSTSNCSNLCCSRGP 

^ HNCQQVIAQTYAARGD 
NITV NK. LLK PML'L EGT 

COTCnJtfS^ TIOXTIGRC TGR!^^ ^ 

SRG SLD .SRP QLV Y.W 
LLEV PLT DPD LNLY TDG 
F.R FP.L IPT STC ILME 

ASQTICXnTC QC3«3AAAAAG GAC^^ 650 
KFLG RKR TLK SGVC SDQ 
SSL AEKG L.K AGY AVIS 
VPW QKK DFEK RGM Q.S 

GIGftE^MQG AftaacriGRA AGERMOQCE TCRCKXaOG AftCraGTOCT 700 
WNT.K .SPHSRN.CS 
DNG ILE SNRL TPG TSA 
VIME YLK VIA SLQE LVL 

CROCTOXM A202«!iaGC (XTCZC^ 750 

PGR TNS PHLG TRI RRR 
HLAE LIA LTW ALEL GEG 
TWQ N. .P SLG H.N .EKE 

K K G K - Y I F ....S...k. . V C L P •§ P P 
KRV NIYS DSK YAY LVLH 
KG. lYI QTLS MLT .SS 

^SKmsmX AGCAJaSOOS ASfiGfiGB^ 850 
CPC SNME REG IPN F.GN 
AHA AIW RERB FLT SEG 
MPMQ QYG ERG NS.L LRB 

ACfiDCIMGA ADCATCAGGG MQOCSOTaG GaGflTraTTR. THAA;i'^l!ftC 900 
TYQPSGKPLGDYYWLY 

TPIN HQG SH. EIII GCT 
HLS TIRE AIR RLL LAVQ 

iCAAfiDCTAA imSGIGQCA GIC^^ *0 
RNLK RWQ SYT ARVI RKK 
ET RGGS LTL PGS SGRR 
KPK EVA VLHC QGH QEE 

GfiGGftftftOOG AAKEfiGAfiGG CAftTOaCJCAA GOSGKEKnG MGCAAftAftA 1000 
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CAATATGGAC AAGCAAAGAA IXSCCCSICCT GTICAAGITA AACTAAAOGA 



CAAXAOX^GAC AAGCAAAGAA TGCCOGlXX:r CTTrCAAGTZA AACTAAAOGA 
TTCTBOCTCC TnOCTTACC AAAGGIUUSTA OOCIGXTAGA OCGCSAOGCXX: 



TIGTQCCTOC TTICGCTACC AAAGSMaZA OCCXCmaA C90CGMXaOOC 
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TACAAOQlOflC TCAAAAGATT GITAAGQAOC TAAAAGOCCA AOQCCXAOEA 
AAACCASGCA GXAGGOCCTG CAAXACTQCA KTrrHRBSMi TAAGGSIAAOC 
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rnTinrTirnflr TrTar""^* V^nnnrrvsA occaaoccoc 

jtfSICTICAGC TTQOCAACAG GQCAAGATTT ■nCTTTATAT GOCRCAGAAA 
AOICTICAGC TTSOCAACAO GOCAAGWTTT TTCTTTATAT GCCRCAGAAA 
AAACAGGAAT AGCTCTRQGA GTCCTTACGC AOgTCTCAGg GaTOAGCTTG 
AAACAGGAAT AQCTCTAGGA tflXX?i TA CG C AQCJICTCftGO GATaAGCTTG 
CAACCOGIX» TATAOCTGAG TAAGGAAATT GATGTAGTGG CAAAwtfxAU 
CAACCOGIOG TATACCTQAG TAAGGAAATT GATGTAGrTGG CAAAGGGrTPG 
GOCTCATTOT TEATO30TAA TGGCGQCAGT AGCAGrTCTTA GTATCTCAAG 
GOCICAITOP TTOTOOGTRA TOQCOGCAGT AGCAGTCITA GIMCTGAAG 
CftOTEAAAAT AAflaCAGQGA AGAGATCTEA CIOTSTGGAC ^f^^f^ 
CACTTAAAAT AAXACftOGGA AOAaATCTTA CXGTOUQGAC AJPCTCATQAT 
GTOAACGGCA TACTCRCTQC TAAAQGMftC TTGTOGTTCJr CRCACAAOCA 
OTGAACGGCA TACTCACIXSC TAAAGGAGAC TIXSfSQSmr CAGACAAOCA 
-nraCTTAAT WPCftGQCTC TMTOCTTGA AGAQCCAGTO CTQAGACTOC 
TTEACTTAAT TAaCftQGCTC TftTEACTIGA AGAflOCftGrTG CTOApACIGC 
GCACTTOTGC AACTCTTAAA CCOQCCACAT TICTT0CA6A ^^^^^^ 
GCACITOIGC AACTCTTAAA CCOGCCACAT TTCTPCCAGA CAATOAAOAA 
AAQATAOAAC ATAACTGTCA ACAAGTAATT GCTCAAACCT ATCeiHiCTaO 
AAGATAGAAC ATAACICTCA ACAACrEAATT GCTCAAACCT ATGCTQCTCG 
jUGGGGACCrr CTAGRGOTIC CCTTOACTSA TOOCGAOCTC AACTT^ 
AQQaOACCTT CTRGROGTTC CCTTOACTCSA TOCCGACCTC AACTTOTATA 
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AOCAOOOQCC A3TASACA0C IGAACMAGQ AAAAOSAATA COCZOTrSCT 
AGCAGGGGOC AnSUEACACC TQAACATAQG AAAAOCSAATA GOCMnTSQCT 
GTCCCCTQCT IGAOGAAOGA ATXAATCCXG AAGTICTOGGC AASSUSAAOGA 
GaC O CCroCT TGAGGAAQGA AmASOCIG AAGTCIGQCX: AAiXAGAAOQA 
CAATATGOAC AAGCAAAGAA TCCCCGTCCT GITCAASITA AACTAAAG6A 
CAAXATOGAC AAGCAAAOAA TQCCOGTCCT GRITCAAOrEA AACIAAAGGA 
T TCTG O CTCC ITICOCTACC AAAflOAAGTA GOCZCmGA CCOQAaSOOC 
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ATCCTTIGAA CCCAAOCXCT CAACTCACCT GCaACTOTm ADCCCAAOGG 
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TTCAGOQATA GCCCCCATCT ATTIXXSOCAG GCATTAGCCC 



AJjaA CTIG AG 

— sAcrrsAG 

A»3AC3S!a&C 



650 
6 

650 



MSBV pol 

cons AIM 41,42,43 
Consensus 



SrqrcA tacctqgaca 



:CA TAOCIOQACA 



ICTIGTCCT TC> 
^rCTTOflCCT TCE 



VPS GATQATTTAC 



700 
58 

700 



MSRV pol 

cons ACM 41,42,43 



CC;nCAGAA ACCTTSTOCC AICAAGOCAC 
CCeiTCAGAA AOCTTOTQCC A3!CAA00CAC 



gamrafat apt 



CCAAG^ACZC 
QCAAG : PJJIC 



750 
108 

750 



MSRV pol 

cons AXXI 41,42,43 



TtapIAOCTS TSGCTACAAU (JmUIAAAC CAAA GGCK I 800 
1 K CTAOCTO T8QCTACAAG GTTTOCAAAC CAAAOGCIC \ 158 
fA^dmmn trsnr^m^^- f^nwrninr^^nn raaaflOTgfe 800 
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Msmr poi 

cons AEN 41*42,43 



KSHV pol 

oms A£N 41,42,43 
Consensus 



MSBV pol 

cons Am 41*42.43 
Consensus 



3CTCTOC1C\ CAG : ft ^iifnA [y.TACTIAGS GCEAAAftTia. TCCRAAQOCA 
SCTCTtaCXCA GAG : AC ; ITA 5 ATaCTTRGG GCTAAAATIA TCCAAAGGCA 
yTV-Trif-irA ramanWlTa Wa'rom'rBnr; <y^RftftftfTTT^ 'TV>rfci^Bryy& 



SI 



XCr CAGTCAOGAA aSTATCXAGC CTATACTOS: ^5tcS5H 
rCCT CAGTOAGC3AA OGIATOCAGC CI31TACTGG3 ITMCC1CA3C 

xrr rafmr^ftfys aa rrrrnTnrAfy^ rrftT acaxacE Unifw^iia- 



CXXAAAACCC TAAAflCAACT AAaASsnC CTSQGCKCMl CAGQTTX^ 
CXX3VAAACCC TAAAOCAACT AA^AGFpTIC CTTGGCATAA CAGOmCTG 



850 
208 

850 

900 
258 

900 

950 
306 

950 



MSRV pel 

cons Am 41*42,43 
Consensus 



0 3 ATOXCA GGT 

ijxcs A'ncocAGGrr i{ 



AGCCAGACCA. TTAf 
AGCCAGAOC^ TEA V 



1000 
358 

1000 



MSBV pol 

cons ACN 41,42,43 
Consensus 



1 AMTA : OGA. AACTCA : AAA GCCAfe 
C AAZTA^ 3GA AACICA^ AAA GCCAE 



ATETAOXAAG AXG8ACAC 
p ATTOAGXAAG AIGGACaL 



1050 
408 

1050 



pol 

cons AGN 41,42.43 




TQGCITECCA OQCOCTAAAO UtfSGOOCZAA CCCAAOOOCC 
TGXXnnCCA gqoocsaaag 



1097 
438 

IVAUUOOCIiAA CCC A AQ O OOC 1100 



HSRV pol 

cona AEN 41,42.43 



HSFV pol 

cons AEN 41,42,43 



usmr pol 

cons AEN 41.42,43 
Consensus 



HSRV pol 

cons AtXI 41.42.43 



USRVpol 

COBESS ACN 41.42.43 



HSRV pol 

cans AX» 41.42.43 



HSHV pol 

cons AZ3H 41,42,43 



MSBV pol 

cons ADN 41,42,43 



HSPV pol 

cons Am 41,42.43 



HSFV pol 

cons Am 41,42,43 



AGVSnCAOC TIGCCAACA6 GQCAAGATTT TitTA ' IA TftT GOCACAGAAA 
AOSSXTCAOC TTOCCAACAG GGCAAGATIT TICTITAXAT GCCACACSAAA 
AAACAGGAAT AOCICXAOCSA GICCTEACCX: AflglCIC A GG OATOAaCTTG 



AAACAOCSAAT AGCICEAGOA GTCdTACQC AGSICICAOQ GAXGAGCITQ 
CAAOXXSIUU TASSUSCZGAG TAAOQAAKXT GASCEAOIOS CAAAOOQITS 
CAAOC0GnDG» TftXAOCIGAO TAAQGAAATT GAi M I A OaO Q CAAAOQOUG 
GOCICAXTST CTUDOQOCAA TOGOU G CAgT AOCAGTCTIA GXAXCTOAAG 

Gcncnancr tiwxuugxaa toooggcagt agcagictta gicaitcxgaag 

CAOnSAAAT AAaACAOQOA AQAGATCnA CWXOZUUAC AICZCAT8AT 



CAOHAAAAT AAT3UC3USGQA AGAGKICnA CXOTCITOGOC AZCIGAIZQAT 
GRtGAAOCaOCA TACTCACZGC TAAAGQAOAC ' I ' AWmiA ' iW CAGACAACCA 



GIGAACQC3CA TACICACZGC TAAAGQAOAC I ' ltfAUJl ' AliT CAOACAAOCA 
1TTACXTAAT TA1CAGGC1C TATIIACTiUA AQAOOCAGOO CIOACaACXOC 



TTTACrCAAT TATCAOQCIC TAXTACnOA AOAOOCAOIO CIGAOACTGC 
GCACTZOroC AACZCTIAAA CC03CCACAT TXCTZOCAGA CAA!IGAAGAA 
OCACITOZGC AACZCXTAAA CCOQCCACAT TTCTZCCAGA CAASGAAGAA 
AAGATAGAAC ATAACTGICA ACAAGTAATT GCTCAAACCT AOGCTOCICG 
AA£aASAGAAC ATAACIOZCA ACAAOTAATr OCICAAACCT ASGCIGCXOS 
AOOOGAOCTr CSAGAOSnC OCnCACIGA TO00QACC2C AACTTSXiAXA 
AaOOGAOCrr CXAOAGSnC CCnGACIOA TSCOOAOCIC AACnOXATA 



U47 
438 

1150 

U97 
438 

1200 

1247 
438 

1250 

1297 
438 

1300 

1347 
438 

1350 

1397 
438 

1400 



1447 
438 

1450 

1497 
438 

1500 

1547 
438 

1550 

1597 
438 

1600 
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Trans of msrv pol 
cons prot 41,42,43 
Consensus 

Trans of msrv pol 
cons prot 41,42,43 
Consensus 

Trans of MSRV pol 
cons prot 41,42,43 
Conseaasus 

Trans of MSRV pol 
cons prot 41,42,43 
Consensus 

Trans of MSRV pol 
cons prot 41,42,43 



fffi^ HHT^oK^ pxccm^a ™pk^ 



KFCSSPOfTP 



5^V^«3Q WRLVQDLRII NEAVFPLVPA 



VSSPniiLSL 



QZ/IWIVLPQG 



50 
SO 
100 
100 
ISO 
150 
200 
200 



Trans of MSRV pol 
cons prot 41,42,43 
Consensus 

^ans of MSRV pol 
cons prot 41,42,43 
Consensus 

Trans of MSRV pol 
cons prot 41,42,43 
Consensus 

Trans of MSRV pol 
cons prot 41,42,43 
Consensus 

Trans of msrv pol 
cons prot 41,42,43 
Consensus 

'^rans of MSRV pol 
cons prot 41,42,43 
Consensus 

Trans of msrv pol 
cons prot 41,42,43 
Consensus 

Trans of msrv pol 
cons prot 41,42,43 
Consensus 

Trans of MSRV pol 
cons prot 41,42,43 
Consensus 




siOQwsLa ancroiAicv vrovsatsm 




wwaAwvLV SEMfflanjaR mnmsm; 



mnawsax. WLsamuiK EMLmmn 

Jlil^ ^^^l^" "«"»TLKP JOTLHWEEK 



^HSCQBm ^TfUmJU. EVfiMBBDU, UTTDOSSL^ 



RQUiKAGYKV 



i^??:^ KEPL^ XNHQEJORRL UAVOKPKEV AVUiCQOH,, 



™™ QADIEAKKftA RQDSPLEMLI BX3P 



400 

136 

400 

450 
140 

450 

500 
146 

500 

550 
146 

550 

600 
146 

600 

650 
146 

650 



683 
146 

663 
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cons ATN 41«42«43 
cons 1.-5,8 

Consensus 

cons AIS« 41,42,43 
cons AON 1«5.8 

Consensus 

cons AEN 41*42,43 
cons AEN 1,5,8 

Consensiis 

cons AEN 41,42,43 
cons AEI3 1 , 5 « 8 

Consensus 

cons AEN 41,42,43 
cons AEN 1,5.8 
Consensus 

cons ADN 41.42,43 
cons AEN 1#5,8 

Consensus 

cons AEN 41,42,43 
cons AEN 1,5,8 

Consensus 



cons AEN 41,42,43 
cons AEN 1,5,6 

Consensus 

cons AEN 41,42,43 
cons AEN 1, S, 8 

Consensus 



F/Gj 55 a 



1 



kSACTTOAGCC AGTTC 






Igacttgagcx: agtc 
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50 
50 

50 

100 
100 

100 



ATCCICATCC 
MCCiCAlCC 



CA B AACC : TA 
CA^ AACC ylA 



ATrrTirftTrr rft>'h^^^>-»^T^ nnnrj^nrT^^ 



AAGCAACTAA 
ARGCAACTAA 



^sdsTTCCT togcataIfS 

: A ^ STTCCT TQGCATA « 2A 



ancTOCC 
3txrrcTGCc 

GBttCECSfiOCi 



FjAATATOGAT TCCC » 3 I TAC 
; AATATQGAT TCCC ^ GfjEAC 



kifxAAGQAAA CTCAftftAAGC CA S TACCCAT 
CTCA : AAAGC CAffTACCCAT 




irAGTAAGAT 
arAGTAAGAT 




OCmtJ C A GG CCCTAAAC 
(XOTTCCAOG CCCTAAAG 



150 
141 

150 



200 
191 

200 



250 
241 

250 



300 
291 

300 




400 
391 

400 



438 
429 

438 



F((q 35 B 



cons prot 41,42,43 
cons prot 1,5,8 

Consensus 

cons prot: 41,42,43 
cons prot: 1,5,8 

Consensus 

cons prot 41,42,43 
cons prot 1, 5,8 

Consensus 




KAQLCSQQVK YUGLKLSK : T 
KAQLCSQQVK YUSLKLSK y T 



QATQALU3FL 
QATQALLNFL 






JVBWTP^EV AFQAU? 
I VKWTPE I EV AFQAIiC 



50 
47 

50 



100 
97 

100 



146 
143 

146 
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